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MOLECULES FOR DISEASE DETECTION AND TREATMENT 

TECHNICAL FIELD 

The present invention relates to molecules for disease detection and treatment and to the use of 
5 these sequences in the diagnosis, study, prevention, and treatment of diseases associated with, as well as 
effects of exogenous compounds on, the expression of molecules for disease detection and treatment. 

BACKGROUND OF THE INVENTION 

The human genome is comprised of thousands of genes, many encoding gene products that 

1 o function in the maintenance and growth of the various cells and tissues in the body. Aberrant 

expression or mutations in these genes and their products is the cause of, or is associated with, a variety 
of human diseases such as cancer and other cell proliferative disorders. The identification of these 
genes and their products is the basis of an ever-expanding effort to find markers for early detection of 
diseases, and targets for their prevention and treatment 
15 For example, cancer represents a type of cell prolif erative disorder that affects nearly every 

tissue in the body. A wide variety of molecules, either aberrantly expressed or mutated, can be the 
cause of, or involved with, various cancers because tissue growth involves complex and ordered 
patterns of cell proliferation, cell differentiation, and apoptosis. Cell proliferation must be regulated to 
maintain both the number of cells and their spatial organization. This regulation depends upon the 

2 o appropriate expression of proteins which control cell cycle progression in response to extracellular 

signals such as growth factors and other mitogens, and intracellular cues such as DN A damage or 
nutrient starvation. Molecules which directly or indirectly modulate cell cycle progression fall into 
several categories, including growth factors and their receptors, second messenger and signal - 
transduction proteins, oncogene products, tumor-suppressor proteins, and mitosis-promoting factors. 

2 5 Aberrant expression or mutations in any of these gene products can result in cell proliferative disorders . 

such as cancer. Oncogenes are genes generally derived from normal genes that, through abnormal 
expression or mutation, can effect the transformation of a normal cell to a malignant one (oncogenesis). 
Oncoproteins, encoded by oncogenes, can affect cell proliferation in a variety of ways and include 
growth factors, growth factor receptors, intracellular signal transducers, nuclear transcription factors, 

3 o and cell-cycle control proteins. In contrast, tumor-suppressor genes are involved in inhibiting cell 

proliferatioa Mutations which cause reduced or loss of function in tumor-suppressor genes result in 
aberrant cell proliferation and cancer. Thus a wide variety of genes and their products have been found 
that are associated with cell proliferative disorders such as cancer, but many more may exist that are 
yet to be discovered. 

3 5 DNA-based arrays can provide a simple way to explore the expression of a single polymorphic 

1 
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gene or a large number of genes. When the expression of a single gene is explored, DNA-based arrays 
are employed to detect the expression of specific gene variants. For example, a p53 tumor suppressor 
gene array is used to determine whether individuals are carrying mutations that predispose them to 
cancer. A cytochrome p450 gene array is useful to determine whether individuals have one of a number 
5 of specific mutations that could result in increased drug metabolism, drug resistance or drug toxicity. 

DNA-based array technology is especially relevant for the rapid screening of expression of a 
large number of genes. There is a growing awareness that gene expression is affected in a global 
fashion. A genetic predisposition, disease or therapeutic treatment may affect, directly or indirectly, the 
expression of a large number of genes. In some cases the interactions may be expected, such as when 

10 the genes are part of the same signaling pathway. In other cases, such as when the genes participate in 
separate signaling pathways, the interactions may be totally unexpected. Therefore, DNA-based arrays 
can be used to investigate how genetic predisposition, disease, or therapeutic treatment affects the 
expression of a large number of genes. 

The discovery of new molecules for disease detection and treatment satisfies a need in the art 

15 by providing new compositions which are useful in the diagnosis, study, prevention, and treatment of 
diseases associated with, as well as effects of exogenous compounds on, the expression of molecules for 
disease detection and treatment 

SUMMARY OF THE INVENTION 

2 o The present invention relates to human disease detection and treatment molecule 

polynucleotides (mddt) as presented in the Sequence Listing. The mddt uniquely identify genes 
encoding structural, functional, and regulatory disease detection and treatment molecules. 

The invention provides an isolated polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 

2 5 SEQ ID NO: 1-45 ; b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-45; c) a 
polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and 
e) an RNA equivalent of a) through d). In one alternative, the polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-45. In another alternative, 

30 the polynucleotide comprises at least 60 contiguous nucleotides of a polynucleotide sequence selected 
from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID 
NO: 1-45; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-45; c) a polynucleotide 
sequence complementary to a); d) a 
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polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The 
invention further provides a composition for the detection of expression of disease detection and 
treatment molecule polynucleotides comprising at least one isolated polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 1-45; b) a naturally occurring polynucleotide sequence having 
at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:l-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 
complementary to b); and e) an RNA equivalent of a) through d); and a detectable label. 

The invention also provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-45; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 1-45; c) a polynucleotide sequence complementary to. 
a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The 
method comprises a) amplifying said target polynucleotide or a fragment thereof using polymerase 
chain reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof . 

The invention also provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-45; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:l-45 ; c) a polynucleotide sequence complementary to 
a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The 
method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex 
is formed between said probe and said target polynucleotide, and b) detecting the presence or absence of 
said hybridization complex, and, optionally, if present, the amount thereof . In one alternative, the probe 
comprises at least 30 contiguous nucleotides. In another alternative, the probe comprises at least 60 
contiguous nucleotides. 

The invention further provides a recombinant polynucleotide comprising a promoter sequence 
operably linked to an isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
45 ; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 

3 
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polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-45; c) a polynucleotide 
sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA 
equivalent of a) through d). In one alternative, the invention provides a ceil transformed with the 
recombinant polynucleotide. In another alternative, the invention provides a transgenic organism 
5 comprising the recombinant polynucleotide. In a further alternative, the invention provides a method 
for producing a disease detection and treatment molecule polypeptide, the method comprising a) 
culturing a cell under conditions suitable for expression of the disease detection and treatment molecule 
polypeptide, wherein said cell is transformed with the recombinant polynucleotide, and b) recovering the 
disease detection and treatment molecule polypeptide so expressed 

1 o The invention also provides a purified disease detection and treatment molecule polypeptide 

(MDDT) encoded by at least one polynucleotide comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO: 1-45. Additionally, the invention provides an isolated antibody 
which specifically binds to the disease detection and treatment molecule polypeptide. The invention 
further provides a method of identifying a test compound which specifically binds to the disease 
1 5 detection and treatment molecule polypeptide, the method comprising the steps of a) providing a test 
compound; b) combining the disease detection and treatment molecule polypeptide with the test 
compound for a sufficient time and under suitable conditions for binding; and c) detecting binding of the 
disease detection and treatment molecule polypeptide to the test compound, thereby identifying the test 
compound which specifically binds the disease detection and treatment molecule polypeptide. 

2 o The invention further provides a microarray wherein at least one element of the microarray is 

an isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide comprising 
a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 1-45; b) a naturally occurring polynucleotide sequence having 
at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ 
25 ID NO.1-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 

complementary to b); and e) an RNA equivalent of a) through d). The invention also provides a method 
for generating a transcript image of a sample which contains polynucleotides. The method comprises a) 
labeling the polynucleotides of the sample, b) contacting the dements of the microarray with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a hybridization complex, 

3 0 and c) quantifying the expression of the polynucleotides in the sample. 

Additionally, the invention provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 1-45; b) a naturally occurring polynucleotide sequence having 



SUBSTITUTE SHEET (RULE 26) 



WO 01/62922 



PCTYUS01/05896 



at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:l-45; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 
complementary to b); and e) an RNA equivalent of a) through d). The method comprises a) exposing a 
sample comprising the target polynucleotide to a compound, and b) detecting altered expression of the 
5 target polynucleotide, and c) comparing the expression of the target polynucleotide in the presence of 
varying amounts of the compound and in the absence of the compound 

The invention further provides a method for assessing toxicity of a test compound, said method 
comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sanqrte with a probe comprising at least 20 

1 o contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 

group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
45; ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-45; iii) a polynucleotide 
sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA 
is equivalent of i)-iv). Hybridization occurs under conditions whereby a specific hybridization complex 
is formed between said probe and a target polynucleotide in the biological sample, said target 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of i) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-45; ii) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide 

2 o sequence selected from the group consisting of SEQ ID NO: 1-45 ; iii) a polynucleotide sequence 

complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA equivalent 
of i)-iv), and alternatively, the target polynucleotide comprises a fragment of a polynucleotide sequence 
selected from the group consisting of i)-v) above; c) quantifying the amount of hybridization complex; 
and d) comparing the amount of hybridization complex in the treated biological sample with the amount 

25 of hybridization complex in an untreated biological sample, wherein a difference in the amount of 
hybridization complex in the treated biological sample is indicative of toxicity of the test compound 
The invention further provides an isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of a) an amino acid sequence selected from the group consisting of 
SEQ ID NO:46-90, b) a naturally occurring amino acid sequence having at least 90% sequence identity 

30 to an amino acid sequence selected from the group consisting of SEQ ID NO:46-90, c) a biologically 
active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:46-90, 
and d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ 
ID NO:46-90. In one alternative, the invention provides an isolated polypeptide comprising the amino 
acid sequence of SEQ ID NO:46-90. 
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DESCRIPTION OF THE TABLES 

Table 1 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with their 
5 GenBank hits (GI Numbers), probability scores, and functional annotations corresponding to the 
GenBank hits. 

Table 2 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
polynucleotide segments of each template sequence as defined by the indicated "start" and "stop" 

1 o nucleotide positions. The reading frames of the polynucleotide segments and the Pf am hits, Pfam 

descriptions, and E-values corresponding to the polypeptide domains encoded by the polynucleotide 
segments are indicated. 

Table 3 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
1 5 polynucleotide segments of each template sequence as defined by the indicated "start" and "stop" 
nucleotide positions. The reading frames of the polynucleotide segments are shown, and the 
polypeptides encoded by the polynucleotide segments constitute either signal peptide (SP) or 
transmembrane (TM) domains, as indicated The membrane topology of the encoded polypeptide 
sequence is indicated, the N-terminus (N) listed as being oriented to either the cytosolic (in) or non- 

2 o cytosolic (out) side of the cell membrane or organelle. 

Table 4 shows the sequence identification numbers (SEQ ID NO:s) corresponding to the 
polynucleotides of the present invention, along with component sequence identification numbers 
(component IDs) corresponding to each template. The component sequences, which were used to 
assemble the template sequences, are defined by the indicated "start" and "stop" nucleotide positions 

2 5 along each template. 

Table 5 shows the tissue distribution profiles for the templates of the inventioa 
Table 6 shows the sequence identification numbers (SEQ ID NO:s) corresponding to the 
polypeptides of the present invention, along with the reading frames used to obtain the polypeptide 
segments, the lengths of the polypeptide segments, the "start" and "stop" nucleotide positions of the 

3 o polynucleotide sequences used to define the encoded polypeptide segments, the GenBank hits (GI 

Numbers), probability scores, and lunctional annotations corresponding to the GenBank hits. 

Table 7 summarizes the bioinformatics tools which are useful for analysis of the 
polynucleotides of the present invention. The first column of Table 7 lists analytical tools, programs, 
and algorithms, the second column provides brief descriptions thereof, the third column presents 
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appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). 

5 

DETAILED DESCRIPTION OF THE INVENTION 

Before the nucleic acid sequences and methods are presented, it is to be understood that this 
invention is not limited to the particular machines, methods, and materials described Although 
particular embodiments are described, machines, methods, and materials similar or equivalent to these 
1 o embodiments may be used to practice the invention. The preferred machines, methods, and materials 
set forth are not intended to limit the scope of the invention which is limited only by the appended 
claims. 

The singular forms "a", "an", and "the" include plural reference unless the context clearly 
dictates otherwise. All technical and scientific terms have the meanings commonly understood by one 
15 of ordinary skill in the art. All publications are incorporated by reference for the purpose of describing 
and disclosing the cell lines, vectors, and methodologies which are presented and which might be used in 
connection with the invention. Nothing in the specification is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 

20 Definitions 

As used herein, the lower case "mddt" refers to a nucleic acid sequence, while the upper case 
"MDDT" refers to an amino acid sequence encoded by mddt. A "full-length" mddt refers to a nucleic 
acid sequence containing the entire coding region of a gene endogenously expressed in human tissue. 

"Adjuvants" are materials such as Freund's adjuvant, mineral gels (aluminum hydroxide), and 
25 surface active substances (lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, and dinitrophenol) which may be administered to increase a host's immunological 
response. 

"Allele" refers to an alternative form of a nucleic acid sequence. Alleles result from a 
"mutation," a change or an alternative reading of the genetic code. Any given gene may have none, one> 
30 or many allelic forms. Mutations which give rise to alleles include deletions, additions, or substitutions 
of nucleotides. Each of these changes may occur alone, or in combination with the others, one or more 
times in a given nucleic acid sequence. The present invention encompasses allelic mddt 

"Amino acid sequence" refers to a peptide, a polypeptide, or a protein of either natural or 
synthetic origin. The amino acid sequence is not limited to the complete, endogenous amino acid 
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sequence and may be a fragment, epitope, variant, or derivative of a protein expressed by a nucleic acid 
sequence. 

"Amplification" refers to the production of additional copies of a sequence and is carried out 
using polymerase chain reaction (PCR) technologies well known in the art 
5 "Antibody" refers to intact molecules as well as to fragments thereof, such as Fab, F(ab')2> and 

Fv fragments, which are capable of binding the epitopic determinant. Antibodies that bind MDDT 
polypeptides can be prepared using intact polypeptides or using fragments containing small peptides of 
interest as the immunizing antigen The polypeptide or peptide used to immunize an animal (e.g., a 
mouse, a rat, or a rabbit) can be derived from the translation of RNA, or synthesized chemically, and 

1 o can be conjugated to a carrier protein if desired Commonly used carriers that are chemically coupled 

to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The 
coupled peptide is then used to immunize the animal. 

"Antisense sequence" refers to a sequence capable of specifically hybridizing to a target 
sequence. The antisense sequence may include DNA, RNA, or any nucleic acid mimic or analog such 
15 as peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 

phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-methoxyethyl sugars or 2-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5 -methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. 

"Antisense sequence" refers to a sequence capable of specifically hybridizing to a target 

2 o sequence. The antisense sequence can be DNA, RNA, or any nucleic acid mimic or analog. 

"Antisense technology" refers to any technology which relies on the specific hybridization of an 
antisense sequence to a target sequence. 

A "bin" is a portion of computer memory space used by a computer program for storage of 
data, and bounded in such a manna: that data stored in a bin may be retrieved by the program. 
25 "Biologically active" refers to an amino acid sequence having a structural, regulatory, or 

biochemical function of a naturally occurring amino acid sequence. 

"Clone joining" is a process for combining gene bins based upon the bins* containing sequence 
information from the same clone. The sequences may assemble into a primary gene transcript as well 
as one or more splice variants. 

3 o "Complementary" describes the relationship between two single-stranded nucleic acid 

sequences that anneal by base-pairing (5-A-G-T-3' pairs with its complement 3'-T-C-A-5'). 

A "component sequence" is a nucleic acid sequence selected by a computer program such as 
PHRED and used to assemble a consensus or template sequence from one or more component 
sequences. 
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A "consensus sequence" or 'template sequence" is a nucleic acid sequence which has been 
assembled from overlapping sequences, using a computer program for fragment assembly such as the 
GELVIEW fragment assembly system (Genetics Computer Group (GCG), Madison WI) or using a 
relational database management system (RDMS). 
5 "Conservative amino acid substitutions" are those substitutions that, when made, least interfere 

with the properties of the original protein, i.e., the structure and especially the function of the protein is 
conserved and not significantly changed by such substitutions. The table below shows amino acids 
which may be substituted for an original amino acid in a protein and which are regarded as conservative 
substitutions. 
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Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 


15 


Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 


20 


His 


Asn, Arg, Gin, Glu 




lie 


Leu, Val 




Leu 


Be, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, lie 


25 


Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 


30 


Val 


De, Leu, Thr 



Conservative substitutions generally maintain (a) the structure of the polypeptide backbone in 
the area of the substitution, for example, as a beta sheet or alpha helical conformation, (b) the charge or 

3 5 hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. 

"Deletion" refers to a change in either a nucleic or amino acid sequence in which at least one 
nucleotide or amino acid residue, respectively, is absent 

"Derivative" refers to the chemical modification of a nucleic acid sequence, such as by 
replacement of hydrogen by an alkyl, acyl, amino, hydroxyl, or other group. 

4 o The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 

chemical compound having a unique and defined position on a microarray. 
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"E-value" refers to the statistical probability that a match between two sequences occurred by 

chance. 

A "fragment" is a unique portion of mddt or MDDT which is identical in sequence to but 
shorter in length than the parent sequence. A fragment may comprise up to the entire length of the 
5 defined sequence, minus one nucleotide/amino acid residue. For example, a fragment may comprise 
from 10 to 1000 contiguous amino acid residues or nucleotides. A fragment used as a probe, prima:, 
antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 
60, 75, 100, 150, 250 or at least 500 contiguous amino acid residues or nucleotides in length. 
Fragments may be preferentially selected from certain regions of a molecule. For example, a 

1 o polypeptide fragment may comprise a certain length of contiguous amino acids selected from the first 

250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain defined sequence. 
Clearly these lengths are exemplary, and any length that is supported by the specification, including the 
Sequence Listing and the figures, may be encompassed by the present embodiments. 

A fragment of mddt comprises a region of unique polynucleotide sequence that specifically 

15 identifies mddt, for example, as distinct from any other sequence in the same genome. A fragment of 
mddt is useful, for example, in hybridization and amplification technologies and in analogous methods 
that distinguish mddt from related polynucleotide sequences. The precise length of a fragment of mddt 
and the region of mddt to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment. 

20 A fragment of MDDT is encoded by a fragment of mddt. A fragment of MDDT comprises a 

region of unique amino acid sequence that specifically identifies MDDT. For example, a fragment of 
MDDT is useful as an immunogenic peptide for the development of antibodies that specifically 
recognize MDDT. The precise length of a fragment of MDDT and the region of MDDT to which the 
fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended 

2 5 purpose for the fragment 

A "full length" nucleotide sequence is one containing at least a start site for translation to a 
protein sequence, followed by an open reading frame and a stop site, and encoding a "full length" 
polypeptide. 

"Hit" refers to a sequence whose annotation will be used to describe a given template. Criteria 
30 for selecting the top hit are as follows: if the template has one or more exact nucleic acid matches, the 
top hit is the exact match with highest percent identity. If the template has no exact matches but has 
significant protein hits, the top hit is the protein hit with the lowest E-value. If the template has no 
significant protein hits, but does have significant non-exact nucleotide hits, the top hit is the nucleotide 
hit with the lowest E-value. 
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"Homology" refers to sequence similarity either between a reference nucleic acid sequence and 
at least a fragment of an mddt or between a reference amino acid sequence and a fragment of an 
MDDT. 

"Hybridization" refers to the process by which a strand of nucleotides anneals with a 
5 complementary strand through base pairing. Specific hybridization is an indication that two nucleic 
acid sequences share a high degree of identity. Specific hybridization complexes form under defined 
annealing conditions, and remain hybridized after the "washing" step. The defined hybridization 
conditions include the annealing conditions and the washing step(s), the latter of which is particularly 
important in determining the stringency of the hybridization process, with more stringent conditions 

10 allowing less non-specific binding, i.e., binding between pairs of nucleic acid probes that are not 
perfectly matched. Permissive conditions for annealing of nucleic acid sequences are routinely 
determinable and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency. 

Generally, stringency of hybridization is expressed with reference to the temperature under 

15 which the wash step is carried out. Generally, such wash temperatures are selected to be about 5°C to 
20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic strength and 
pH. The T ra is the temperature (under defined ionic strength and pH) at which 50% of the target 
sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions for 
nucleic acid hybridization is well known and can be found in Sambrook et al., 1989, Molecular 

20 Cloning: A Laboratory Manual , 2 nd ed, vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, or 55°C may be used SSC concentration may be 

25 varied from about 0.2 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking reagents 
are used to block non-specific hybridization. Such blocking reagents include, for instance, denatured 
salmon sperm DNA at about 100-200 (ig/ml. Useful variations on these conditions will be readily 
apparent to those skilled in the art. Hybridization, particularly undo: high stringency conditions, may 
be suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative 

30 of a similar role for the nucleotides and their resultant proteins. 

Other parameters, such as temperature, salt concentration, and detergent concentration may be 
varied to achieve the desired stringency. Denaturants, such as formamide at a concentration of about 
35-50% v/v, may also be used undo- particular circumstances, such as RNA:DNA hybridizations. 
Appropriate hybridization conditions are routinely (teterminable by one of ordinary skill in the art. 
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'Immunogenic" describes the potential for a natural, recombinant, or synthetic peptide, epitope, 
polypeptide, or protein to induce antibody production in appropriate animals, cells, or cell lines. 

"Insertion" or "addition" refers to a change in either a nucleic or amino acid sequence in which 
at least one nucleotide or residue, respectively, is added to the sequence. 
5 "Labeling" refers to the covalent or noncovalent joining of a polynucleotide, polypeptide, or 

antibody with a reporter molecule capable of producing a detectable or measurable signal. 

"Microarray" is any arrangement of nucleic acids, amino acids, antibodies, etc., on a substrate. 
The substrate may be a solid support such as beads, glass, paper, nitrocellulose, nylon, or an 
appropriate membrane. 

10 'linkers" are short stretches of nucleotide sequence which may be added to a vector or an mddt 

to create restriction endonuclease sites to facilitate cloning. "Polylinkers" are engineered to incorporate 
multiple restriction enzyme sites and to provide for the use of enzymes which leave 5' or 3* overhangs 
(e.g., BamHI, EcoRI, and Hindlll) and those which provide blunt ends (e.g., EcoRV, SnaBI, and StuI). 
"Naturally occurring" refers to an endogenous polynucleotide or polypeptide that may be 

1 5 isolated from viruses or prokaryotic or eukaryotic cells. 

"Nucleic acid sequence" refers to the specific order of nucleotides joined by phosphodiester 
bonds in a linear, polymeric arrangement Depending on the number of nucleotides, the nucleic acid 
sequence can be considered an oligomer, oligonucleotide, or polynucleotide. The nucleic acid can be 
DNA, RNA, or any nucleic acid analog, such as PNA, may be of genomic or synthetic origin, may be 

2 o either double-stranded or single-stranded, and can represent either the sense or antisense 

(complementary) strand. 

"Oligomer" refers to a nucleic acid sequence of at least about 6 nucleotides and as many as 
about 60 nucleotides, preferably about 15 to 40 nucleotides, and most preferably between about 20 and 
30 nucleotides, that may be used in hybridization or amplification technologies. Oligomers may be used 
25 as, e.g., primers for PCR, and are usually chemically synthesized. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 

3 o where necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to a DNA mimic in which nucleotide bases are attached to 
a pseudopeptide backbone to increase stability. PNAs, also designated antigene agents, can prevent 
gene expression by targeting complementary messenger RNA 
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The phrases ''percent identity" and "% identity", as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
5 achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence 
alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 

10 and Sharp, P.M. (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequence pairs. 

15 Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 

provided by the National Cento for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.hhn.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
2 o programs including "blastn," that is used to determine alignment between a known polynucleotide 
sequence and other sequences on a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf^bl2/. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 

2 5 programs are commonly used with gap and other parameters set to default settings. For example, to 

compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

3 o Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
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Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 
the length of a fragment taken from a largo:, defined sequence, for instance, a fragment of at least 20, at 
5 least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in figures or Sequence Listings, may be used to describe a length over which percentage 
identity may be measured 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

1 o similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 

nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein 

The phrases "percent identity" and "% identity", as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 

1 5 standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the hydrophobicity and acidity of the substituted 
residue, thus preserving the structure (and therefore function) of the folded polypeptide. 

Percent identity between polypeptide sequences may be determined using the default parameters 

20 of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 

2 5 between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) with blastp set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 
30 Open Gap: 11 and Extension Gap: 1 penalty 

Gap x drop-off: 50 

Expect: 10 

Word Size: 3 

Filter: on 
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Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance* 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
5 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in figures or Sequence Listings, may be used to describe a 
length over which percentage identity may be measured 

"Post-translational modification" of an MDDT may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
10 art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
cell type depending on the enzymatic milieu and the MDDT. 

'Trobe" refers to mddt or fragments thereof, which are used to detect identical, allelic or related 
nucleic acid sequences. Probes are isolated oligonucleotides or polynucleotides attached to a detectable 
label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemfluminescent 
1 5 agents, and enzymes. 'Trimers" are short nucleic acids, usually DNA oligonucleotides, which may be 
annealed to a target polynucleotide by complementary base-pairing. The primer may then be extended 
along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification 
(and identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
2 o nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 30, 40, 50, 60, 70, 80, 90, 100, or 
at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the figures and Sequence Listing, may be used. 

2 5 Methods for preparing and using probes and primers are described in the references, for 

example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel et al.,1987, Current Protocols in Molecular Biology , 
Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols, A Guide 
to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs can be derived from 

3 0 a known sequence, for example, by using compute programs intended for that purpose such as Primer 

(Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA), 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
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nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
5 and is thus useful for designing primers on a genome- wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Cento* for Genome Research, 
Cambridge MA) allows the user to input a "misprinting library/' in which sequences to avoid as primer 
binding sites are user-specified Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 

l o their respective sources and modified to meet the user' s specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 

15 polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

2 o "Purified" refers to molecules, either polynucleotides or polypeptides that are isolated or 

separated from their natural environment and are at least 60% free, preferably at least 75 % free, and 
most preferably at least 90% free from other compounds with which they are naturally associated 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 

2 5 This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 

artificial manipulation of isolated segments of nucleic acids, e.g„ by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promote sequence. 

3 o Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

"Regulatory element" refers to a nucleic acid sequence from nontranslated regions of a gene, 
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and includes enhancers, promoters, introns, and 3' untranslated regions, which interact with host 
proteins to carry out or regulate transcription or translation. 

"Reporter" molecules are chemical or biochemical moieties used for labeling a nucleic acid, an 
amino acid, or an antibody. They include radionuclides; enzymes; fluorescent, chemiluminescent, or 
5 chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and other moieties known in 
the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 

1 o instead of deoxyribose. 

"Sample" is used in its broadest sense. Samples may contain nucleic or amino acids, 
antibodies, or other materials, and may be derived from any source (e.g„ bodily fluids including, but not 
limited to, saliva, blood, and urine; chromosome(s), organelles, or membranes isolated from a cell; 
genomic DNA, RNA, or cDNA in solution or bound to a substrate; and cleared cells or tissues or blots 
15 or imprints from such cells or tissues). 

"Specific binding" or "specifically binding" refers to the interaction between a protein or 
peptide and its agonist, antibody, antagonist, or other binding partner. The interaction is dependent 
upon the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 

2 o presence of a polypeptide containing epitope A, or the presence of free unlabeled A, in a reaction 

containing free labeled A and the antibody will reduce the amount of labeled A that binds to the 
antibody. 

"Substitution" refers to the replacement of at least one nucleotide or amino acid by a different 
nucleotide or amino acid. 

2 5 "Substrate" refers to any suitable rigid or semi-rigid support including, e.g., membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles or capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular tissue or 

30 cell type under given conditions at a given time. 

"Transformation" refers to a process by which exogenous DNA enters a recipient cell. 
Transformation may occur undo: natural or artificial conditions using various methods well known in 
the art Transformation may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. Hie method is selected based on the host cell being 
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transformed 

'Transformants" include stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well as 
cells which transiently express inserted DNA or RNA 
5 A "transgenic organism," as used herein, is any organism, including but not limited to animals 

and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic techniques well known in the art The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant 

1 o virus. The term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, 

but rather is directed to the introduction of a recombinant DNA molecule. The transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fiingi, and plants 
and animals. The isolated DNA of the present invention can be introduced into the host by methods 
known in the art, for example infection, transfection, transformation or transconjugation. Techniques 
1 5 for transferring the DNA of the present invention into such organisms are widely known and provided in 
references such as Sambrook et al. (1989), supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 25 % sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 

2 o set at default parameters. Such a pair of nucleic acids may show, for example, at least 30%, at least 

50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or even at least 98% or 
greater sequence identity over a certain defined length. The variant may result in "conservative" amino 
acid changes which do not affect structural and/or chemical properties. , A variant may be described as, 
for example, an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice 

2 5 variant may have significant identity to a reference molecule, but will generally have a greater or lessor 

number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
corresponding polypeptide may possess additional functional domains or lack domains that are present 
in the reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides generally will have significant 

3 o amino acid identity relative to each other. A polymorphic variant is a variation in the polynucleotide 

sequence of a particular gene between individuals of a given species. Polymorphic variants also may 
encompass "single nucleotide polymorphisms" (SNPs) in which the polynucleotide sequence varies by 
one base. The presence of SNPs may be indicative of, for example, a certain population, a disease 
state, or a propensity for a disease state. 

18 



SUBSTITUTE SHEET (RULE 26) 



WO 01/62922 



PCTYUS01/05896 



In an alternative, variants of the polynucleotides of the present invention may be generated 
through recombinant methods. One possible method is a DNA shuffling technique such as 
MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
5 Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or improve 
the biological properties of MDDT, such as its biological or enzymatic activity or its ability to bind to 
other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 

1 o preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 

selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 
recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 
fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
1 5 family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 

2 o 1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 

least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity ova: a certain defined length of one of the polypeptides. 

THE INVENTION 

25 In a particular embodiment, cDNA sequences derived from human tissues and cell lines ware 

aligned based on nucleotide sequence identity and assembled into "consensus" or "template" sequences 
which are designated by the template identification numbers (template IDs) in column 2 of Table 1 . 
The sequence identification numbers (SEQ ID NO:s) corresponding to the template IDs are shown in 
column 1. The template sequences have similarity to GenBank sequences, or "hits," as designated by 

30 the GI Numbers in column 3. The statistical probability of each GenBank hit is indicated by a 

probability score in column 4, and the functional annotation corresponding to each GenBank hit is listed 
in column 5. 

The invention incorporates the nucleic acid sequences of these templates as disclosed in the 
Sequence Listing and the use of these sequences in the diagnosis and treatment of disease states 
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characterized by defects in disease detection and treatment molecules. The invention further utilizes 
these sequences in hybridization and amplification technologies, and in particular, in technologies which 
assess gene expression patterns correlated with specific cells or tissues and their responses in vivo or in 
vitro to pharmaceutical agents, toxins, and other treatments. In this manner, the sequences of the 
5 present invention are used to develop a transcript image for a particular cell or tissue. 

Derivation of Nucleic Acid Sequences 

cDNA was isolated from libraries constructed using RNA derived from normal and diseased 
human tissues and cell lines. The human tissues and cell lines used for cDNA library construction were 

10 selected from a broad range of sources to provide a diverse population of cDNAs representative of gene 
transcription throughout the human body. Descriptions of the human tissues and cell lines used for 
cDNA library construction are provided in the LIFESEQ database (Incyte Genomics, Inc. (Incyte), Palo 
Alto CA). Human tissues were broadly selected from, for example, cardiovascular, dermatologic, 
endocrine, gastrointestinal, hematopoietic/immune system, musculoskeletal, neural, reproductive, and 

15 urologic sources. 

Cell lines used for cDNA library construction were derived from, for example, leukemic cells, 
teratocarcinomas, neuroepitheliomas, cervical carcinoma, lung fibroblasts, and endothelial cells. Such 
cell lines include, for example, THP-1, Jurkat, HUVEC, hNT2, WI38, HeLa, and other cell lines 
commonly used and available from public depositories (American Type Culture Collection, Manassas 
2 o VA). Prior to mRNA isolation, cell lines were untreated, treated with a pharmaceutical agent such as 
5 -aza-2 -deoxycytidine, treated with an activating agent such as lipopolysaccharide in the case of 
leukocytic cell lines, or, in the case of endothelial cell lines, subjected to shear stress. 

Sequencing of the cDNAs 

2 5 Methods for DNA sequencing are well known in the art. Conventional enzymatic methods 

employ the Klenow fragment of DNA polymerase I, SEQUENASE DNA polymerase (U.S. 
Biochemical Corporation, Cleveland OH), Taq polymerase (Applied Biosystems, Foster City CA), 
thermostable T7 polymerase (Amersham Pharmacia Biotech, Inc. (Amersham Pharmacia Biotech), 
Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found in 

30 the ELONGASE amplification system (Life Technologies Inc. (Life Technologies), Gaithersburg MD), 
to extend the nucleic acid sequence from an oligonucleotide prima: annealed to the DNA template of 
interest. Methods have been developed for the use of both single-stranded and double-stranded 
templates. Chain termination reaction products may be electrophoresed on urea-polyacrylamide gels 
and detected either by autoradiography (for radioisotope-labeled nucleotides) or by fluorescence (for 
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fluorophore-labeled nucleotides). Automated methods for mechanized reaction preparation, sequencing, 
and analysis using fluorescence detection methods have been developed Machines used to prepare 
cDNAs for sequencing can include the MICROLAB 2200 liquid transfer system (Hamilton Company 
(Hamilton), Reno NV), Peltier thermal cycler (PTC200; MJ Research, Inc. (MJ Research), Watertown 
5 MA), and ABI CATALYST 800 thermal cycler (Applied Biosystems). Sequencing can be carried out 
using, for example, the ABI 373 or 377 (Applied Biosystems) or MEGABACE 1000 (Molecular 
Dynamics, Inc. (Molecular Dynamics), Sunnyvale CA) DNA sequencing systems, or other automated 
and manual sequencing systems well known in the art. 

The nucleotide sequences of the Sequence Listing have been prepared by current, state-of-the- 

10 art, automated methods and, as such, may contain occasional sequencing errors or unidentified 

nucleotides. Such unidentified nucleotides are designated by an N. These infrequent unidentified bases 
do not represent a hindrance to practicing the invention for those skilled in the art. Several methods 
employing standard recombinant techniques may be used to correct errors and complete the missing 
sequence informatioa (See, e.g., those described in Ausubel, F.M. et al. (1997) Short Protocols in 

15 Molecular Biology . John Wiley & Sons, New York NY; and Sambrook, J. et al. (1989) Molecular 
Cloning, A Laboratory Manual , Cold Spring Harbor Press, Plainview NY.) 

Assembly of cDNA Sequences 

Human polynucleotide sequences may be assembled using programs or algorithms well known 

20 in the art. Sequences to be assembled are related, wholly or in part, and may be derived from a single 
or many different transcripts. Assembly of the sequences can be performed using such programs as 
PHRAP (Phils Revised Assembly Program) and the GEL VIEW fragment assembly system (GCG), or 
other methods known in the art. 

Alternatively, cDNA sequences are used as "component" sequences that are assembled into 

25 "template" or "consensus" sequences as follows. Sequence chromatograms are processed, verified, and 
quality scores are obtained using PHRED. Raw sequences are edited using an editing pathway known 
as Block 1 (See, e.g., the LIFESEQ Assembled User Guide, Incyte Genomics, Palo Alto, CA). A series 
of BLAST comparisons is performed and low-information segments and repetitive elements (e.g., 
dinucleotide repeats, Alu repeats, etc.) are replaced by "n's", or masked, to prevent spurious matches. 

3 o Mitochondrial and ribosomal RNA sequences are also removed. The processed sequences are then 
loaded into a relational database management system (RDMS) which assigns edited sequences to 
existing templates, if available. When additional sequences are added into the RDMS, a process is 
initiated which modifies existing templates or creates new templates from works in 
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progress (i.e„ nonfinal assembled sequences) containing queued sequences or the sequences themselves. 
After the new sequences have been assigned to templates, the templates can be merged into bins. If 
multiple templates exist in one bin, the bin can be split and the templates reannotated. 

Once gene bins have been generated based upon sequence alignments, bins are "clone joined" 
5 based upon clone information. Clone joining occurs when the 5' sequence of one clone is present in one 
bin and the 3' sequence from the same clone is present in a different bin, indicating that the two bins 
should be merged into a single bia Only bins which share at least two different clones are merged. 

A resultant template sequence may contain either a partial or a full length open reading frame, 
or all or part of a genetic regulatory element. This variation is due in part to the fact that the full length 

10 cDNAs of many genes are several hundred, and sometimes several thousand, bases in length With 
current technology, cDNAs comprising the coding regions of large genes cannot be cloned because of 
vector limitations, incomplete reverse transcription of the mRNA, or incomplete "second strand" 
synthesis. Template sequences may be extended to include additional contiguous sequences derived 
from the parent RNA transcript using a variety of methods known to those of skill in the art. Extension 

is may thus be used to achieve the full length coding sequence of a gene. 

Analysis of the cDNA Sequences 

The cDNA sequences are analyzed using a variety of programs and algorithms which are well 
known in the art. (See, e.g., Ausubel, 1997, supra , Chapter 7.7; Meyers, R.A. (Ed.) (1995) Molecular 

2 o Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853; and Table 7.) These analyses 
comprise both reading frame determinations, e.g., based on triplet codon periodicity for particular 
organisms (Fickett, J.W. (1982) Nucleic Acids Res. 10:5303-5318); analyses of potential start and stop 
codons; and homology searches. 

Computer programs known to those of skill in the art for performing computer-assisted 

25 searches for amino acid and nucleic acid sequence similarity, include, for example, Basic Local 

Alignment Search Tool (BLAST; Altschul, S.F. (1993) J. Mol. Evol. 36:290-300; Altschul, S.F. et al. 
(1990) J. Mol. Biol. 215:403-410). BLAST is especially useful in determining exact matches and 
comparing two sequence fragments of arbitrary but equal lengths, whose alignment is locally maximal 
and for which the alignment score meets or exceeds a threshold or cutoff score set by the user (Karlin, 

30 S. et al. (1988) Proc. Natl. Acad Sci. USA 85:841-845). Using an appropriate search tool (e.g., 

BLAST or HMM), GenBank, SwissProt, BLOCKS, PFAM and other databases may be searched for 
sequences containing regions of homology to a query mddt or MDDT of the present invention. 

Other approaches to the identification, assembly, storage, and display of nucleotide and 
polypeptide sequences are provided in "Relational Database for Storing Biomolecule Information," 
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U.S.S.N. 08/947,845, filed October 9, 1997; "Project-Based Full-Length Biomolecular Sequence 
Database," U.S.S.N. 08/811,758, filed March 6, 1997; and "Relational Database and System for 
Storing Information Relating to Biomolecular Sequences," U.S.S.N. 09/034,807, filed March 4, 1998, 
all of which are incorporated by reference herein in their entirety. 
5 Protein hierarchies can be assigned to the putative encoded polypeptide based on, e.g., motif, 

BLAST, or biological analysis. Methods for assigning these hierarchies are described, for example, in 
"Database System Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data," 
U.S.S.N. 08/812,290, filed March 6, 1997, incorporated herein by reference. 

10 Human Disease Detection and Treatment Molecule Sequences 

The mddt of the present invention may be used for a variety of diagnostic and therapeutic 
purposes. For example, an mddt may be used to diagnose a particular condition, disease, or disorder 
associated with disease detection and treatment molecules. Such conditions, diseases, and disorders 
include, but are not limited to, a cell proliferative disorder, such as actinic keratosis, arteriosclerosis, 

15 atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, 
paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and 
cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
teratocarcinoma, and, in particular, a cancer of the adrenal gland, bladder, bone, bone marrow, brain, 
breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, 

20 pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 
uterus; and an autoinmaune/inflammatory disorder, such as actinic keratosis, acquired 
immunodeficiency syndrome (AIDS), Addison* s disease, adult respiratory distress syndrome, 
allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, bursitis, cholecystitis, cirrhosis, 

25 contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 

emphysema, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 
Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, paroxysmal nocturnal 
hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel syndrome, episodic lymphopenia with 
lymphocytotoxins, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia gravis, 

3 o myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, 
polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary 
thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, trauma, and hematopoietic 

3 5 cancer including lymphoma, leukemia, and myeloma. The mddt can be used to detect the presence of, 
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or to quantify the amount of, an mddt-related polynucleotide in a sample. This information is then 
compared to information obtained from appropriate reference samples, and a diagnosis is established. 
Alternatively, a polynucleotide complementary to a given mddt can inhibit or inactivate a 
therapeutically relevant gene related to the mddt 

5 

Analysis of mddt Expression Patterns 

The expression of mddt may be routinely assessed by hybridization-based methods to 
determine, for example, the tissue-specificity, disease-specificity, or developmental stage-specificity of 
mddt expression. For example, the level of expression of mddt may be compared among different cell 

10 types or tissues, among diseased and normal cell types or tissues, among cell types or tissues at 

different developmental stages, or among cell types or tissues undergoing various treatments. This type 
of analysis is useful, for example, to assess the relative levels of mddt expression in fully or partially 
differentiated cells or tissues, to determine if changes in mddt expression levels are correlated with the 
development or progression of specific disease states, and to assess the response of a cell or tissue to a 

1 5 specific therapy, for example, in pharmacological or toxicological studies. Methods for the analysis of 
mddt expression are based on hybridization and amplification technologies and include membrane-based 
procedures such as northern blot analysis, high-throughput procedures that utilize, for example, 
microarrays, and PCR-based procedures. 

20 Hybridization and Genetic Analysis 

The mddt, their fragments, or complementary sequences, may be used to identify the presence 
of and/or to determine the degree of similarity between two (or more) nucleic acid sequences. The mddt 
may be hybridized to naturally occurring or recombinant nucleic acid sequences under appropriately 
selected temperatures and salt concentrations. Hybridization with a probe based on the nucleic acid 

25 sequence of at least one of the mddt allows for the detection of nucleic acid sequences, including 

genomic sequences, which are identical or related to the mddt of the Sequence Listing. Probes may be 
selected from non-conserved or unique regions of at least one of the polynucleotides of SEQ ID NO:l- 
45 and tested for their ability to identify or amplify the target nucleic acid sequence using standard 
protocols. 

3 o Polynucleotide sequences that are capable of hybridizing, in particular, to those shown in SEQ 

ID NO: 1-45 and fragments thereof, can be identified using various conditions of stringency. (See, e.g., 
WaM, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, AR. (1987) Methods 
Enzymol. 152:507-5 11,) Hybridization conditions are discussed in "Definitions." 

A probe for use in Southern or northern hybridization may be derived from a fragment of an 
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mddt sequence, or its complement, that is up to several hundred nucleotides in length and is either 
single-stranded or double-stranded. Such probes may be hybridized in solution to biological materials 
such as plasmids, bacterial, yeast, or human artificial chromosomes, cleared or sectioned tissues, or to 
artificial substrates containing mddt Microarrays are particularly suitable for identifying the presence 
5 of and detecting the level of expression for multiple genes of interest by examining gene expression 
correlated with, e.g., various stages of development, treatment with a drug or compound, or disease 
progression. An array analogous to a dot or slot blot may be used to arrange and link polynucleotides 
to the surface of a substrate using one or more of the following: mechanical (vacuum), chemical, 
thermal, or UV bonding procedures. Such an array may contain any number of mddt and may be 

1 o produced by hand or by using available devices, materials, and machines. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
15 2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

Probes may be labeled by either PCR or enzymatic techniques using a variety of commercially 
available reporter molecules. For example, commercial kits are available for radioactive and 
chemiluminescent labeling (Amersham Pharmacia Biotech) and for alkaline phosphatase labeling (Life 
Technologies). Alternatively, mddt may be cloned into commercially available vectors for the 

2 o production of RNA probes. Such probes may be transcribed in the presence of at least one labeled 

nucleotide (e.g., 32 P-ATP, Amersham Pharmacia Biotech). 

Additionally the polynucleotides of SEQ ID NO: 1-45 or suitable fragments thereof can be used 
to isolate full length cDNA sequences utilizing hybridization and/or amplification procedures well 
known in the art, e.g., cDNA library screening, PCR amplification, etc. The molecular cloning of such 
2 5 full length cDN A sequences may employ the method of cDNA library screening with probes using the 
hybridization, stringency, washing, and probing strategies described above and in Ausubel, supra , 
Chapters 3, 5, and 6. These procedures may also be employed with genomic libraries to isolate 
genomic sequences of mddt in order to analyze, e.g., regulatory elements. 

30 Genetic Mapping 

Gene identification and mapping are important in the investigation and treatment of almost all 
conditions, diseases, and disorders. Cancer, cardiovascular disease, Alzheimer's disease, arthritis, 
diabetes, and mental illnesses are of particular interest Each of these conditions is more complex than 
the single gene defects of sickle cell anemia or cystic fibrosis, with select groups of genes being 
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predictive of predisposition for a particular condition, disease, or disorder. For example, 
cardiovascular disease may result from malfunctioning receptor molecules that fail to clear cholesterol 
from the bloodstream, and diabetes may result when a particular individual's immune system is 
activated by an infection and attacks the insulin-producing cells of the pancreas. In some studies, 
5 Alzheimer's disease has been linked to a gene on chromosome 21 ; other studies predict a different gene 
and location. Mapping of disease genes is a complex and reiterative process and generally proceeds 
from genetic linkage analysis to physical mapping. 

As a condition is noted among members of a family, a genetic linkage map traces parts of 
chromosomes that are inherited in the same pattern as the condition. Statistics link the inheritance of 

1 o particular conditions to particular regions of chromosomes, as defined by RFLP or other markers. 

(See, for example, Lander, E. S. and Botstein, D. (1986) Proc. Natl. Acad Sci. USA 83:7353-7357.) 
Occasionally, genetic markers and their locations are known from previous studies. More often, 
however, the markers are simply stretches of DNA that differ among individuals. Examples of genetic 
linkage maps can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
15 (OMIM) World Wide Web site. 

In another embodiment of the invention, mddt sequences may be used to generate hybridization 
probes useful in chromosomal mapping of naturally occurring genomic sequences. Either coding or 
noncoding sequences of mddt may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of an mddt coding sequence among 

2 o members of a multi-gene family may potentially cause undesired cross hybridization during 

chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific region 
of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et aL (1997) Nat. 
25 Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet 
7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 
mapping techniques and genetic map data. (See, e.g., Meyers, supra , pp. 965-968.) Correlation 
between the location of mddt on a physical chromosomal map and a specific disorder, or a 

3 o predisposition to a specific disorder, may help define the region of DNA associated with that disorder. 

The mddt sequences may also be used to detect polymorphisms that are genetically linked to the 
inheritance of a particular condition, disease, or disorder. 

In situ hybridization of chromosomal preparations and genetic mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending existing genetic 
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maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
mouse, may reveal associated markers even if the number or arm of the corresponding human 
chromosome is not known. These new marker sequences can be mapped to human chromosomes and 
may provide valuable information to investigators searching for disease genes using positional cloning 
5 or other gene discovery techniques. Once a disease or syndrome has been crudely correlated by genetic 
linkage with a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 
mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., 
Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequences of the subject invention may 
also be used to detect differences in chromosomal architecture due to translocation, inversion, etc., 

10 among normal, carrier, or affected individuals. 

Once a disease-associated gene is mapped to a chromosomal region, the gene must be cloned in 
order to identify mutations or other alterations (e.g., translocations or inversions) that may be correlated 
with disease. This process requires a physical map of the chromosomal region containing the disease- 
gene of interest along with associated markers. A physical map is necessary for determining tbe 

is nucleotide sequence of and order of marker genes on a particular chromosomal region. Physical 

mapping techniques are well known in the art and require the generation of overlapping sets of cloned 
DNA fragments from a particular organelle, chromosome, or genome. These clones are analyzed to 
reconstruct and catalog their order. Once the position of a marker is determined, the DNA from that 
region is obtained by consulting the catalog and selecting clones from that region. The gene of interest 

20 is located through positional cloning techniques using hybridization or similar methods. 

Diagnostic Uses 

The mddt of the present invention may be used to design probes useful in diagnostic assays. 
Such assays, well known to those skilled in the art, may be used to detect or confirm conditions, 

25 disorders, or diseases associated with abnormal levels of mddt expression. Labeled probes developed 
from mddt sequences are added to a sample under hybridizing conditions of desired stringency. In some 
instances, mddt, or fragments or oligonucleotides derived from mddt, may be used as primers in 
amplification steps prior to hybridizatioa The amount of hybridization complex formed is quantified 
and compared with standards for that cell or tissue. If mddt expression varies significantly from the 

3 o standard, the assay indicates the presence of the condition, disorder, or disease. Qualitative or 

quantitative diagnostic methods may include northern, dot blot, or other membrane or dip-stick based 
technologies or multiple-sample format technologies such as PCR, enzyme-linked immunosorbent assay 
(ELISA)-like, pin, or chip-based assays. 
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The probes described above may also be used to monitor the progress of conditions, disorders, 
or diseases associated with abnormal levels of mddt expression, or to evaluate the efficacy of a 
particular therapeutic treatment The candidate probe may be identified from the mddt that are specific 
to a given human tissue and have not been observed in GenBank or other genome databases. Such a 
5 probe may be used in animal studies, preclinical tests, clinical trials, or in monitoring the treatment of 
an individual patient In a typical process, standard expression is established by methods well known in 
the art for use as a basis of comparison, samples from patients affected by the disorder or disease are 
combined with the probe to evaluate any deviation from the standard profile, and a therapeutic agent is 
administered and effects are monitored to generate a treatment profile. Efficacy 

10 is evaluated by determining whether the expression progresses toward or returns to the standard normal 
pattera Treatment profiles may be generated ova: a period of several days or several months. 
Statistical methods well known to those skilled in the art may be use to determine the significance of 
such therapeutic agents. 

The polynucleotides are also useful for identifying individuals from minute biological samples, 

15 for example, by matching the RFLP pattern of a sample's DNA to that of an individual's DNA The 
polynucleotides of the present invention can also be used to determine the actual base-by-base DNA 
sequence of selected portions of an individual's genome. These sequences can be used to prepare PGR 
primers for amplifying and isolating such selected DNA, which can then be sequenced. Using this 
technique, an individual can be identified through a unique set of DNA sequences. Once a unique ID 

2 o database is established for an individual, positive identification of that individual can be made from 
extremely small tissue samples. 

In a particular aspect, oligonucleotide primers derived from the mddt of the invention may be 
used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and 
deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of SNP 

2 5 detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and 

fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from mddt are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 
differences in the secondary and tertiary structures of PCR products in single-stranded form, and these 

3 o differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 

oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 
methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 
sequences of individual overlapping DNA fragments which assemble into a common consensus 
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sequence. These computer-based methods filter out sequence variations due to laboratory preparation 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry 
using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 
5 DNA-based identification techniques are critical in forensic technology. DNA sequences taken 

from very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, 
semen, etc., can be amplified using, e.g., PCR, to identify individuals. (See, e.g., Erlich, H. (1992) 
PCR Technology . Freeman and Co., New York, NY). Similarly, polynucleotides of the present 
invention can be used as polymorphic markers, 
l o There is also a need for reagents capable of identifying the source of a particular tissue. 

Appropriate reagents can comprise, for example, DNA probes or primers prepared from the sequences 
of the present invention that are specific for particular tissues. Panels of such reagents can identify 
tissue by species and/or by organ type. In a similar fashion, these reagents can be used to screen tissue 
cultures for contamination. 

1 5 The polynucleotides of the present invention can also be used as molecular weight markers on 

nucleic acid gels or Southern blots, as diagnostic probes for the presence of a specific mRNA in a 
particular cell type, in the creation of subtracted cDNA libraries which aid in the discovery of novel 
polynucleotides, in selection and synthesis of oligomers for attachment to an array or other support, and 
as an antigen to elicit an immune response. 

20 Disease Model Systems Using mddt 

The mddt of the invention or their mammalian homologs may be "knocked out" in an animal 
model system using homologous recombination in embryonic stem (ES) cells. Such techniques are well 
known in the art and are useful for the generation of animal models of human disease. (See, e.g„ U.S. 
Patent Number 5,175,383 and U.S. Patent Number 5,767,337.) For example, mouse ES cells, such as 

25 the mouse 1 29/S v J cell line, are derived from the early mouse embryo and grown in culture. The ES 
cells are transformed with a vector containing the gene of interest disrupted by a marker gene, e.g., the 
neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) Science 244:1288-1292). The vector 
integrates into the corresponding region of the host genome by homologous recombination. 
Alternatively, homologous recombination takes place using the Cre-loxP system to knockout a gene of 

3 o interest in a tissue- or developmental stage-specific manner (Marth, J.D. (1996) Clin. Invest. 97:1999- 
2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). Transformed ES cells are 
identified and microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. 
The blastocysts are surgically transferred to pseudopregnant dams, and the resulting chimeric progeny 
are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus 
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generated may be tested with potential therapeutic or toxic agents. 

The mddt of the invention may also be manipulated in vitro in ES cells derived from human 
blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell lineages 
including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate into, for 
5 example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. (1998) Science 
282:1145-1147). 

The mddt of the invention can also be used to create "knockin" humanized animals (pigs) or 
transgenic animals (mice or rats) to model human disease. With knockin technology, a region of mddt 
is injected into animal ES cells, and the injected sequence integrates into the animal cell genome. 

1 o Transformed cells are injected into blastulae, and the blastulae are implanted as described above. 

Transgenic progeny or inbred lines are studied and treated with potential pharmaceutical agents to 
obtain information on treatment of a human disease. Alternatively, a mammal inbred to overexpress 
mddt, resulting, e.g., in the secretion of MDDT in its milk, may also serve as a convenient source of 
that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 

15 

Screening Assays 

MDDT. encoded by polynucleotides of the present invention may be used to screen for 
molecules that bind to or are bound by the encoded polypeptides. The binding of the polypeptide and 
the molecule may activate (agonist), increase, inhibit (antagonist), or decrease activity of the 

2 o polypeptide or the bound molecule. Examples of such molecules include antibodies, oligonucleotides, 

proteins (e.g., receptors), or small molecules. 

Preferably, the molecule is closely related to the natural ligand of the polypeptide, e.g„ a ligand 
or fragment thereof, a natural substrate, or a structural or functional mimetic. (See, Coligan et al., 
(1991) Current Protocols in Immunology 1(2): Chapter 5.) Similarly, the molecule can be closely 

2 5 related to the natural receptor to which the polypeptide binds, or to at least a fragment of the receptor, 

e.g., the active site. In either case, the molecule can be rationally designed using known techniques. 
Preferably, the screening for these molecules involves producing appropriate cells which express the 
polypeptide, either as a secreted protein or on the cell membrane. Preferred cells include cells from 
mammals, yeast, Drosophila , or E. coli . Cells expressing the polypeptide or cell membrane fractions 

3 o which contain the expressed polypeptide are then contacted with a test compound and binding, 

stimulation, or inhibition of activity of either the polypeptide or the molecule is analyzed. 

An assay may simply test binding of a candidate compound to the polypeptide, wherein binding 
is detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. Alternatively, 
the assay may assess binding in the presence of a labeled competitor. 
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Additionally, the assay can be carried out using cell-free preparations, polypeptide/molecule 
affixed to a solid support, chemical libraries, or natural product mixtures. The assay may also simply 
comprise the steps of mixing a candidate compound with a solution containing a polypeptide, measuring 
polypeptide/molecule activity or binding, and comparing the polypeptide/molecule activity or binding to 
5 a standard. 

Preferably, an ELISA assay using, e.g., a monoclonal or polyclonal antibody, can measure 
polypeptide level in a sample. The antibody can measure polypeptide level by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 

All of the above assays can be used in a diagnostic or prognostic context. The molecules 
10 discovered using these assays can be used to treat disease or to bring about a particular result in a 
patient (e.g., blood vessel growth) by activating or inhibiting the polypeptide/molecule. Moreover, the 
assays can discover agents which may inhibit or enhance the production of the polypeptide from 
suitably manipulated cells or tissues. 

15 Transcript Imaging and Toxicological Testing 

Another embodiment relates to the use of mddt to develop a transcript image of a tissue or cell 
type. A transcript image represents the global pattern of gene expression by a particular tissue or cell 
type. Global gene expression patterns are analyzed by quantifying the number of expressed genes and 
their relative abundance under given conditions and at a given time. (See Seilhamer et al. , 

20 "Comparative Gene Transcript Analysis," U.S. Patent Number 5,840,484, expressly incorporated by 
reference herein.) Thus a transcript image may be generated by hybridizing the polynucleotides of the 
present invention or their complements to the totality of transcripts or reverse transcripts of a particular 
tissue or cell type. In one embodiment, the hybridization takes place in high-throughput format, 
wherein the polynucleotides of the present invention or their complements comprise a subset of a 

25 plurality of elements on a microarray. The resultant transcript image would provide a profile of gene 
activity pertaining to disease detection and treatment molecules. 

Transcript images which profile mddt expression may be generated using transcripts isolated 
from tissues, cell lines, biopsies, or other biological samples. The transcript image may thus reflect 
mddt expression in vivo , as in the case of a tissue or biopsy sample, or in vitro , as in the case of a cell 

30 line. 

Transcript images which profile mddt expression may also be used in conjunction with in vitro 
model systems and preclinical evaluation of pharmaceuticals, as well as toxicological testing of 
industrial and naturally-occurring environmental compounds. All compounds induce characteristic 
gene expression patterns, frequently tamed molecular fingerprints or toxicant signatures, which are 
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indicative of mechanisms of action and toxicity (Nuwaysir, E. F. et al. (1999) Mol. Carcinog. 24:153- 
159; Steiner, S. and Anderson, N. L. (2000) Toxicol. Lett. 112-113:467-71, expressly incorporated by 
reference herein). If a test compound has a signature similar to that of a compound with known 
toxicity, it is likely to share those toxic properties. These fingerprints or signatures are most useful and 
5 refined when they contain expression information from a large number of genes and gene families. 
Ideally, a genome-wide measurement of expression provides the highest quality signature. Even genes 
whose expression is not altered by any tested compounds are important as well, as the levels of 
expression of these genes are used to normalize the rest of the expression data. The normalization 
procedure is useful for comparison of expression data after treatment with different compounds. While 

10 the assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity 
mechanisms, knowledge of gene function is not necessary for the statistical matching of signatures 
which leads to prediction of toxicity. (See, for example, Press Release 00-02 from the National 
Institute of Environmental Health Sciences, released February 29, 2000, available at 
httpy/www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 

15 toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to the polynucleotides of the present 

2 0 invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of MDDT encoded by polynucleotides of the 
present invention to analyze the proteome of a tissue or cell type. The term proteome refers to the 

2 5 global pattern of protein expression in a particular tissue or cell type. Each protein component of a 

proteome can be subjected individually to further analysis. Proteome expression patterns, or profiles, 
are analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 

3 o achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 

isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
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genially proportional to the level of the protein in the sample. The optical densities of equivalently 
positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment. The proteins in the spots are partially sequenced using, for 
5 example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for MDDT to quantify the 

10 levels of MDDT expressioa In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting the 
levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103-1 1; 
Mendoze, L. G. et al. (1999) Biotechniques 27:778-88). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 

1 5 reactive fluorescent compound and detecting the amount of fluorescence bound at each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 
be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N. L. and 
Seilhamer, J. (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 

2 o analysis of compounds which do not significantly affect the transcript image, but which alto the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 

2 5 sample are separated so that the amount of each protein can be quantified. The amount of each pirotein 
is compared to the amount of the corresponding protein in an untreated biological sample. A difference 
in the amount of protein between the two samples is indicative of a toxic response to the test compound 
in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the MDDT encoded by polynucleotides of 

30 the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the MDDT encoded by polynucleotides of the present invention. The amount 
of protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
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sample is compared with the amount in an untreated biological sample. A difference in the amount of 
protein between the two samples is indicative of a toxic response to the test compound in the treated 
sample. 

Transcript images may be used to profile mddt expression in distinct tissue types. This process 
5 can be used to determine disease detection and treatment molecule activity in a particular tissue type 
relative to this activity in a different tissue type. Transcript images may be used to generate a profile of 
mddt expression characteristic of diseased tissue. Transcript images of tissues before and after 
treatment may be used for diagnostic purposes, to monitor the progression of disease, and to monitor 
the efficacy of drug treatments for diseases which affect the activity of disease detection and treatment 
10 molecules. 

Transcript images of cell lines can be used to assess disease detection and treatment molecule 
activity and/or to identify cell lines that lack or misregulate this activity. Such cell lines may then be 
treated with pharmaceutical agents, and a transcript image following treatment may indicate the 
efficacy of these agents in restoring desired levels of this activity. A similar approach may be used to 
is assess the toxicity of pharmaceutical agents as reflected by undesirable changes in disease detection and 
treatment molecule activity. Candidate pharmaceutical agents may be evaluated by comparing their 
associated transcript images with those of pharmaceutical agents of known effectiveness. 

Antisense Molecules 

2 o The polynucleotides of the present invention are useful in antisense technology. Antisense 

technology or therapy relies on the modulation of expression of a target protein through the specific 
binding of an antisense sequence to a target sequence encoding the target protein or directing its 
expression. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., Totawa 
NJ; Alama, A. et al. (1997) Pharmacol. Res. 36(3):171-178; Crooke, S.T. (1997) Adv. Pharmacol. 

25 40:1-49; Sharma, H.W. and R. Narayanan (1995) Bioessays 17(12):1055-1063; and Lavrosky, Y. et 
al. (1997) Biochem. Mol. Med. 62(l):ll-22.) An antisense sequence is a polynucleotide sequence 
capable of specifically hybridizing to at least a portion of the target sequence. Antisense sequences 
bind to cellular mRNA and/or genomic DNA, affecting translation and/or transcription. Antisense 
sequences can be DNA, RNA, or nucleic acid mimics and analogs. (See, e.g., Rossi, J.J. et al. (1991) 

30 Antisense Res. Dev. l(3):285-288; Lee, R. et al. (1998) Biochemistry 37(3):900-1010; Pardridge, 
W.M. et al. (1995) Proc. Natl. Acad, Sci. USA 92(12):5592-5596; and Nielsen, P. E. and Haaima, G. 
(1997) Chem. Soc. Rev. 96:73-78.) Typically, the binding which results in modulation of expression 
occurs through hybridization or binding of complementary base pairs. Antisense sequences can also 
bind to DNA duplexes through specific interactions in the major groove of the double helix. 
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The polynucleotides of the present invention and fragments thereof can be used as antisense 
sequences to modify the expression of the polypeptide encoded by mddt The antisense sequences can 
be produced ex vivo , such as by using any of the ABI nucleic acid synthesizer series (Applied 
Biosystems) or other automated systems known in the art. Antisense sequences can also be produced 
5 biologically, such as by transforming an appropriate host cell with an expression vector containing the 
sequence of interest (See, e.g., Agrawal, smgra.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense sequences 
into appropriate target cells can be used. Antisense sequences can be delivered intracellularly in the 
form of an expression plasmid which, upon transcription, produces a sequence complementary to at 

1 o least a portion of the cellular sequence encoding the target protein (See, e.g. , Slater, J.E. , et al. (1 998) 

J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J., et al. (1995) 9(13):1288-1296.) 
Antisense sequences can also be introduced intracellularly through the use of viral vectors, such as 
retrovirus and adeno-associated virus vectors. (See, e.g., Miller, AD. (1990) Blood 76:271; Ausubel, 
F.M. et al. (1995) Current Protocols in Molecular Biology , John Wiley & Sons, New York NY; Uckert, 
15 W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other gene delivery mechanisms include 
liposome-derived systems, artificial viral envelopes, and other systems known in the art. (See, e.g., 
Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et al. (1998) J. Pharm. Sci. 87(11):1308- 
1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 25(14):2730-2736.) 

20 Expression 

In order to express a biologically active MDDT, the nucleotide sequences encoding MDDT or 
fragments thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
suitable host. Methods which are well known to those skilled in the art may be used to construct 

2 5 expression vectors containing sequences encoding MDDT and appropriate transcriptional and 

translational control elements. These methods include in vitro recombinant DNA techniques, synthetic 
techniques, and in vivo genetic recombination. (See, e.g., Sambrook, supra , Chapters 4, 8, 16, and 17; 
and Ausubel, supra . Chapters 9, 10, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 

3 o encoding MDDT. These include, but are not limited to, microorganisms such as bacteria transformed 

with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
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animal (mammalian) cell systems. (See, e.g., Sambrook, supra ; Ausubel, 1995, supra . Van Heeke, G. 

andS.M. Schuster (1989) J. BioL Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 

153:516-544; Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) 

Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; 
5 Takamatsu, N. (1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; BrogUe, 

R. et al. (1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; 

The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 

191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, 

J.J. etal. (1997) Nat. Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, 
10 or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 

sequences to the targeted organ, tissue, or cell population. (See, e.g., Di Nicola, M. et al. (1998) 

Cancer Gea Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; 

Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 

31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is not 
1 5 limited by the host cell employed. 

For long term production of recombinant proteins in mammalian systems, stable expression of 

MDDT in cell lines is preferred. For example, sequences encoding MDDT can be transformed into cell 

lines using expression vectors which may contain viral origins of replication and/or endogenous 

expression elements and a selectable marker gene on the same or on a separate vector. Any number of 
2 o selection systems may be used to recover transformed cell lines. (See, e.g. , Wigler, M. et al. (1 977) 

Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.; Wigler, M. et al. (1980) Proc. Natl. Acad. 

Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14; Hartman, S.C. and 

R.C.Mulligan (1988) Proc. Natl. Acad. Sci. USA 85:8047-8051; Rhodes, C.A. (1995) Methods Mol. 

Biol. 55:121-131.) 

25 

Therapeutic Uses of mddt 

The mddt of the invention may be used for somatic or germhne gene therapy. Gene therapy 
may be performed to (i) correct a genetic deficiency (e.g., in the cases of severe combined 
immunodeficiency (SCID)-Xl disease characterized by X-linked inheritance (Cavazzana-Calvo, M. et 
30 al. (2000) Science 288 :669-672), severe combined immunodeficiency syndrome associated with an 
inherited adenosine deaminase (ADA) deficiency (Blaese, R.M. et al. (1995) Science 270:475-480; 
Bordignon, C. et al. (1995) Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207- 
216; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:667-703), thalassemias, familial hypercholesterolemia, and hemophilia resulting from Factor 
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Vm or Factor IX deficiencies (Crystal, R.G. (1995) Science 270:404-410; Verma, I.M. and Somia, N. 
(1997) Nature 389:239-242)), (ii) express a conditionally lethal gene product (e.g., in the case of 
cancers which result from unregulated cell proliferation), or (iii) express a protein which affords 
protection against intracellular parasites (e.g., against human retroviruses, such as human 
5 immunodeficiency virus (HTV) (Baltimore, D. (1988) Nature 335:395-396; Poeschla, E. et al. (1996) 
Proc. Natl. Acad Sci. USA. 93:11395-11399), hepatitis B or C virus (HBV, HCV); fungal parasites, 
such as Candida albicans and Paracoccidioides brasiliensis ; and protozoan parasites such as 
Plasmodium falciparum and Trypanosoma cruzi) . In the case where a genetic deficiency in mddt 
expression or regulation causes disease, the expression of mddt from an appropriate population of 

1 o transduced cells may alleviate the clinical manifestations caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in mddt 
are treated by constructing mammalian expression vectors comprising mddt and introducing these 
vectors by mechanical means into mddt-deficient cells. Mechanical transfer technologies for use with 
cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold 
1 5 particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the 
use of DNAtransposons (Morgan, R.A. and Anderson, W.R (1993) Annu. Rev. Biocheia 62:191-217; 
Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. andR&ipon, H. (1998) Curr. Opia Biotechnol. 9:445- 
450). 

Expression vectors that may be effective for the expression of mddt include, but are not limited 
20 to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). The mddt of the invention 
may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 
sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an inducible promoter 

2 5 (e.g., the tetracycline-regulated promoter (Gossen, M. and Bujard, H. (1992) Proc. Natl. Acad. Sci. 

U.S.A. 89:5547-5551; Gossen, M. et al., (1995) Science 268:1766-1769; Rossi, RM.V. and Blau, 
H.M. (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REXplasmid 
(Invitrogen); the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; 
Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter 

3 o (Rossi, F.M. V. and Blau, H.M. supra) , or (iii) a tissue-specific promoter or the native promoter of the 

endogenous gene encoding MDDT from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target ceils in culture and require minimal effort to optimize experimental 
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parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and Eb, AJ. (1973) Virology 52:456-467), or by dectroporation (Neumann, E. et al. 
(1982) EMBO J. 1 :841 -845), The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 
5 In another embodiment of the invention, diseases or disorders caused by genetic defects with 

respect to mddt expression are treated by constructing a retrovirus vector consisting of (i) mddt under 
the control of an independent promoter or the retrovirus long terminal repeat (LTR) promoter, (ii) 
appropriate RNA packaging signals, and (iii) a Rev-responsive element (RRE) along with additional 
retrovirus ds-acting RNA sequences and coding sequences required for efficient vector propagation. 

1 o Retrovirus vectors (e.g., PFB and PFBNEO) are commercially available (Stratagene) and are based on 

published data (Riviere, I. et al. (1995) Proc. Natl. Acad. Sci. U.S.A 92:6733-6737), incorporated by 
reference hereia The vector is propagated in an appropriate vector producing cell line (VPCL) that 
expresses an envelope gene with a tropism for receptors on the target ceils or a promiscuous envelope 
protein such as VSVg (Armentano, D. et al. (1987) J. Virol. 61:1647-1650; Bender, M.A et al. (1987) 
15 J. Virol. 61:1639-1646; Adam, M.A andMiller, A.D. (1988) J. Virol. 62:3802-3806; Dull, T. et al. 
(1998) J. Virol. 72:8463-8471; Zufferey, R. et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 
5,910,434 to Rigg ("Method for obtaining retrovirus packaging cell lines producing high transducing 
efficiency retroviral supernatant") discloses a method for obtaining retrovirus packaging cell lines and 
is hereby incorporated by reference. Propagation of retrovirus vectors, transduction of a population of 

2 o cells (e.g., CD4 + T-cells), and the return of transduced cells to a patient are procedures well known to 

parsons skilled in the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. 
Virol. 71:7020-7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 
71:4707-4716; Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. U.S.A 95:1201-1206; Su, L. (1997) 
Blood 89:2283-2290). 

25 In the alternative, an adenovirus-based gene therapy delivery system is used to deliver mddt to 

cells which have one or more genetic abnormalities with respect to the expression of mddt. The 
construction and packaging of adenovirus-based vectors are well known to those with ordinary skill in 
the art. Replication defective adenovirus vectors have proven to be versatile for importing genes 
encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M.E. et al. (1995) 

3 o Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. Patent 
Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), hereby incorporated by 
reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. Rev, Nutr. 19:511-544 
and Verma, I.M. andSomia, N. (1997) Nature 18:389:239-242, both incorporated by reference herein. . 
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In another alternative* a herpes-based, gene therapy delivery system is* used to deliver mddt to 
target cells which have one or more genetic abnormalities with respect to the expression of mddt The 
use of herpes simplex virus (HSV)-based vectors may be especially valuable for introducing mddt to 
cells of the central nervous system, for which HS V has a tropism. The construction and packaging of 
5 herpes-based vectors are well known to those with ordinary skill in the art. A rephcation-competent 
herpes simplex virus (HSV) type 1 -based vector has been used to deliver a reporter gene to the eyes of 
primates (Liu, X. et al. (1999) Exp. Eye Res.l69:385-395). The construction of a HSV-1 virus vector 
has also been disclosed in detail in U.S. Patent Number 5,804,413 to DeLuca ("Herpes simplex virus 
strains for gene transfer"), which is hereby incorporated by reference. U.S. Patent Number 5,804,413 

1 o teaches the use of recombinant HSV d92 which consists of a genome containing at least one exogenous 

gene to be transfeared to a cell under the control of the appropriate promoter for purposes including 
human gene therapy. Also taught by this patent are the construction and use of recombinant HSV 
strains deleted for ICP4, ICP27 and ICP22. For HSV vectors, see also Goins, W. F. et al. 1999 J. 
Virol. 73:519-532 and Xu, H. et al., (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. 
1 5 The manipulation of cloned herpesvirus sequences, the generation of recombinant virus following the 
transfection of multiple plasmids containing different segments of the large herpesvirus genomes, the 
growth and propagation of herpesvirus, and the infection of cells with herpesvirus are techniques well 
known to those of ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 

2 o deliver mddt to target cells. The biology of the prototypic alphavirus, Semliki Forest Virus (SFV), has 

been studied extensively and gene transfer vectors have been based on the SFV genome (Garoff , H. and 
Li, K-J. (1998) Curr. Opin. Biotech. 9:464-469). During alphavirus RNA replication, a subgenomic 
RNA is generated that normally encodes the viral capsid proteins. This subgenomic RNA replicates to 
higher levels than the full-length genomic RNA, resulting in the overproduction of capsid proteins 

2 5 relative to the viral proteins with enzymatic activity (e. g. , protease and polymerase). Similarly, 

inserting mddt into the alphavirus genome in place of the capsid-coding region results in the production 
of a large number of mddt RN As and the synthesis of high levels of MDDT in vector 
transduced cells. While alphavirus infection is typically associated with cell lysis within a few days, the 
ability to establish a persistent infection in hamster normal kidney cells (BHK-21) with a variant of 

3 o Sindbis virus (SIN) indicates that the lytic replication of alphaviruses can be altered to suit the needs of 

the gene therapy application (Dryga, S. A. et al. (1997) Virology 228:74-83). The wide host range of 
alphaviruses will allow the introduction of mddt into a variety of cell types. The specific transduction 
of a subset of cells in a population may require the sorting of cells prior to transduction. The methods 
of manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 
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transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art 

Antibodies 

5 Anti-MDDT antibodies may be used to analyze protein expression levels. Such antibodies 

include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, and Fab fragments. For 
descriptions of and protocols of antibody technologies, see, e.g., Pound J.D. (1998) Immunochemical 
Protocols , Humana Press, Totowa, NJ. 

The amino acid sequence encoded by the mddt of the Sequence Listing may be analyzed by 

10 appropriate software (e.g., LASERGENE NAVIGATOR software, DNASTAR) to determine regions 
of high immunogenicity. The optimal sequences for immunization are selected from the C -terminus, the 
N-terminus, and those intervening, hydrophilic regions of the polypeptide which are likely to be exposed 
to the external environment when the polypeptide is in its natural conformatioa Analysis used to select 
appropriate epitopes is also described by Ausubel (1997, supra , Chapter 1 1 .7). Peptides used for 

1 5 antibody induction do not need to have biological activity; however, they must be antigenic. Peptides 
used to induce specific antibodies may have an amino acid sequence consisting of at least five amino 
acids, preferably at least 10 amino acids, and most preferably at least 15 amino acids. A peptide which 
mimics an antigenic fragment of the natural polypeptide may be fused with another protein such as 
keyhole hemolimpet cyanin (KLH; Sigma, St. Louis MO) for antibody production. A peptide 

20 encompassing an antigenic region may be expressed from an mddt, synthesized as described above, or 
purified from human cells. 

Procedures well known in the art may be used for the production of antibodies. Various hosts 
including mice, goats, and rabbits, may be immunized by injection with a peptide. Depending on the 
host species, various adjuvants may be used to increase immunological response. 

25 In one procedure, peptides about 15 residues in length may be synthesized using an ABI 43 1 A 

peptide synthesizer (Applied Biosystems) using fmoc-chemistry and coupled to KLH (Sigma) by 
reaction with M-maldmidobenzoyl-N-hydroxysuccinimide ester (Ausubel, 1995, supra) . Rabbits are 
immunized with the peptide-KLH complex in complete Freund's adjuvant The resulting antisera are 
tested for antipeptide activity by binding the peptide to plastic, blocking with 1 % bovine serum albumin 

3 o (BS A), reacting with rabbit antisera, washing, and reacting with radioiodinated goat anti-rabbit IgG. 
Antisera with antipeptide activity are tested for anti-MDDT activity using protocols well known in the 
art, including ELISA, radioimmunoassay (RIA), and immunoblotting. 

In another procedure, isolated and purified peptide may be used to immunize mice (about 100 
[ig of peptide) or rabbits (about 1 mg of peptide). Subsequently, the peptide is radioiodinated and used 
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to screen the immunized animals' B -lymphocytes for production of antipeptide antibodies. Positive 
cells are then used to produce hybridomas using standard techniques. About 20 mg of peptide is 
sufficient for labeling and screening several thousand clones. Hybridomas of interest are detected by 
screening with radioiodinated peptide to identify those fusions producing peptide-specific monoclonal 
5 antibody. In a typical protocol, wells of a multi-well plate (FAST, Becton-Dickinson, Palo Alto, CA) 
are coated with affinity-purified, specific rabbit-anti-mouse (or suitable anti-species IgG) antibodies at 
10 mg/ml. The coated wells are blocked with 1 % BSA and washed and exposed to supernatants from 
hybridomas. After incubation, the wells are exposed to radiolabeled peptide at 1 mg/ml. 

Clones producing antibodies bind a quantity of labeled peptide that is detectable above 

1 o background. Such clones are expanded and subjected to 2 cycles of cloning. Cloned hybridomas are 

injected into pristane-treated mice to produce ascites, and monoclonal antibody is purified from the 
ascitic fluid by affinity chromatography on protein A (Amersham Pharmacia Biotech). Several 
procedures for the production of monoclonal antibodies, including in vitro production, are described in 
Pound (supra) . Monoclonal antibodies with antipeptide activity are tested for anti-MDDT activity 
1 5 using protocols well known in the art, including ELIS A, RIA, and immunoblotting. 

Antibody fragments containing specific binding sites for an epitope may also be generated. For 
example, such fragments include, but are not limited to, the F(ab*)2 fragments produced by pepsin 
digestion of the antibody molecule, and the Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, construction of Fab expression libraries in filamentous 

2 o bacteriophage allows rapid and easy identification of monoclonal fragments with desired specificity 

(Pound, supra . Chaps. 45-47). Antibodies generated against polypeptide encoded by mddt can be used 
to purify and characterize full-length MDDT protein and its activity, binding partners, etc. 

Assays Using Antibodies 

2 5 Anti-MDDT antibodies may be used in assays to quantify the amount of MDDT found in a 

particular human ceil. Such assays include methods utilizing the antibody and a label to detect 
expression level under normal or disease conditions. The peptides and antibodies of the invention may 
be used with or without modification or labeled by joining than, either covalently or noncovalently, 
with a reporter molecule. 

3 o Protocols for detecting and measuring protein expression using either polyclonal or monoclonal 

antibodies are well known in the art Examples include ELIS A, RIA, and fluorescent activated cell 
sorting (FACS). Such immunoassays typically involve the formation of complexes between the MDDT 
and its specific antibody and the measurement of such complexes. These and other assays are described 
in Pound (supra) . 

41 

SUBSTITUTE SHEET (RULE 26) 



WO 01/62922 PCT7US01/05896 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of 
the disclosure in any way whatsoever. 
5 The disclosures of all patents, applications, and publications mentioned above and below, in 

particular U.S. Ser. No. 60/185,213, U.S. Ser. No. 60/205,285, U.S. Ser. No. 60/205,232, U.S. Ser. 
No. 60/205,323, U.S. Ser. No. 60/205,287, U.S. Ser. No. 60/205,324, and U.S. Ser. No. 60/205,286, 
are hereby expressly incorporated by reference. 

10 EXAMPLES 
I. Construction of cDNA Libraries 

RNA was purchased from CLONTECH Laboratories, Inc. (Palo Alto CA) or isolated from 
various tissues. Some tissues were homogenized and lysed in guanidinium isothiocyanate, while others 
were homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
1 5 Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates 
were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In most cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
2 o using oligo d(T)-coupled paramagnetic particles (Promega Corporation (Promega), Madison WI), 
OLIGOTEX latex particles (QIAGEN, Inc. (QIAGEN), Valencia CA), or an OLIGOTEX mRNA 
purification kit (QIAGEN). Alternatively, RNA was isolated directly from tissue lysates using other 
RNA isolation kits, e.g., the POLY(A)PURE mRNA purification kit (Ambion, Inc., Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDN A 

2 5 libraries. Otherwise, cDNA was synthesized and cDNA libraries ware constructed with the UNIZAP 

vector system (Stratagene Cloning Systems, Inc. (Stratagene), La Jolla CA) or SUPERSCRIPT 
plasmid system (Life Technologies), using the recommended procedures or similar methods known in 
the art. (See, e.g., Ausubel, 1997, supra . Chapters 5.1 through 6.6.) Reverse transcription was 
initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were ligated to double 

3 o stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme or enzymes. For 

most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL S1000, SEPHAROSE 
CL2B, or SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or 
preparative agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of 
thepolylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid 
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(Life Technologies), PCDNA2.1 plasmid (Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), 
or pINCY (Incyte Genomics, Palo Alto CA), or derivatives thereof. Recombinant plasmids ware 
transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 

5 

II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision using the UNIZAP vector system 
(Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: the Magic or 
WIZARD Minipreps DNA purification system (Promega); the AGTC Miniprep purification kit (Edge 
10 BioSystems, Gaithersburg MD); and the QIAWELL 8, QIAWELL 8 Plus, and QIAWELL 8 Ultra 
plasmid purification systems or the R.E.AL. PREP 96 plasmid purification kit (QIAGEN). Following 
precipitation, plasmids were resuspended in 0. 1 ml of distilled water and stored, with or without 
lyqphilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
is high-throughput format. (Rao, V.B. (1994) Anal. Biochem. 216:1-14.) Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Inc. (Molecular Probes), Eugene OR) and a FLUOROSKAN H 
fluorescence scanner (Labsystems Oy, Helsinki, Finland). 

20 

III. Sequencing and Analysis 

cDNA sequencing reactions ware processed using standard methods or high-throughput 
instrumentation such as the ABI CATALYST 800 thermal cycler (Applied Biosystems) or the PTC- 
200 thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific 

2 5 Corp. , Sunnyvale C A) or the MICROLAB 2200 liquid transfer system (Hamilton). cDNA sequencing 

reactions wore prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI 
sequencing kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit 
(Applied Biosystems). Electrophoretic separation of cDNA sequencing reactions and detection of 
labeled polynucleotides were carried out using the MEGABACE 1000 DNA sequencing system 

3 o (Molecular Dynamics); the ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in 

conjunction with standard ABI protocols and base calling software; or other sequence analysis systems 
known in the art. Reading frames within the cDNA sequences were identified using standard methods 
(reviewed in Ausubel, 1997, supra . Chapter 7.7). Some of the cDNA sequences were selected for 
extension using the techniques disclosed in Example VIII. 
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IV. Assembly and Analysis of Sequences 

Component sequences from chromatograms were subject to PHRED analysis and assigned a 
quality score. The sequences having at least a required quality score were subject to various pre- 
processing editing pathways to eliminate, e.g., low quality 3' ends, vector and linker sequences, polyA 
5 tails, Alu repeats, mitochondrial and ribosomal sequences, bacterial contamination sequences, and 
sequences smaller than 50 base pairs. In particular, low-information sequences and repetitive elements 
(e.g., dinucleotide repeats, Alu repeats, etc.) were replaced by "n's", or masked, to prevent spurious 
matches. 

Processed sequences ware then subject to assembly procedures in which the sequences were 

1 o assigned to gene bins (bins). Each sequence could only belong to one bin. Sequences in each gene bin 

were assembled to produce consensus sequences (templates). Subsequent new sequences were added to 
existing bins using BLASTn (v. 1 A WashU) and CROSSMATCH. Candidate pairs were identified as 
all BLAST hits having a quality score greater than or equal to 150. Alignments of at least 82% local 
identity ware accepted into the bin. The component sequences from each bin were assembled using a 
15 version of PHRAP. Bins with several overlapping component sequences were assembled using DEEP 
PHRAP. The orientation (sense or antisense) of each assembled template was determined based on the 
number and orientation of its component sequences. Template sequences as disclosed in the sequence 
listing correspond to sense strand sequences (the "forward" reading frames), to the best determination. 
The complementary (antisense) strands are inherently disclosed herein. The component sequences 

2 o which were used to assemble each template consensus sequence are listed in Table 4, along with their 

positions along the template nucleotide sequences. 

Bins were compared against each other and those having local similarity of at least 82% were 
combined and reassembled. Reassembled bins having templates of insufficient overlap (less than 95% 
local identity) were re-split. Assembled templates were also subject to analysis by STITCHER/EXON 

2 5 MAPPER algorithms which analyze the probabilities of the presence of splice variants, alternatively 

spliced exons, splice junctions, differential expression of alternative spliced genes across tissue types or 
disease states, etc. These resulting bins were subject to several rounds of the above assembly 
procedures. 

Once gene bins were generated based upon sequence alignments, bins were clone joined based 

3 o upon clone information. If the 5' sequence of one clone was present in one bin and the 3 1 sequence from 

the same clone was present in a different bin, it was likely that the two bins actually belonged together 
in a single bin. The resulting combined bins underwent assembly procedures to regenerate the 
consensus sequences. 
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The final assembled templates were subsequently annotated using the following procedure. 
Template sequences were analyzed using BLASTn (v2.0, NCBI) versus gbpri (GenBaak version 120). 
"Hits" were defined as an exact match having from 95% local identity over 200 base pairs through 
100% local identity over 100 base pairs, or a homolog match having an E- value, i.e. a probability 
5 score, of <; 1 x 10~ 8 . The hits were subject to frameshift FASTx versus GENPEPT (GenBank version 
120). (See Table 7). In this analysis, a homolog match was defined as having an E-value of £ 1 x 10" 8 . 
The assembly method used above was described in "System and Methods for Analyzing Biomolecular 
Sequences," U.S.S.N. 09/276,534, filed March 25, 1999, and the LEFESEQ Gold user manual (Incyte) 
both incorporated by reference herein. 

10 Following assembly, template sequences were subjected to motif, BLAST, and functional 

analyses, and categorized in protein hierarchies using methods described in, e.g., "Database System 
Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data,*' U.S.S.N. 
08/812,290, filed March 6, 1997; "Relational Database for Storing Biomolecule Information," 
U.S.S.N. 08/947,845, filed October 9, 1997; "Project-Based Full-Length Biomolecular Sequence 

15 Database," U.S.S.N. 08/81 1,758, filed March 6, 1997; and "Relational Database and System for 

Storing Information Relating to Biomolecular Sequences," U.S.S.N. 09/034,807, filed March 4, 1998, 
all of which are incorporated by reference herein. 

The template sequences were further analyzed by translating each template in all three forward 
reading frames and searching each translation against the Pf am database of hidden Markov model- 

2 o based protein families and domains using the HMMER software package (available to the public from 
Washington University School of Medicine, St. Louis MO). Regions of templates which, when 
translated, contain similarity to Pfam consensus sequences are reported in Table 2, along with 
descriptions of Pfam protein domains and families. Only those Pfam hits with an E-value of <, 1 x 10" 3 
are reported. (See also World Wide Web site http://pfam.wustl.edu/ for detailed descriptions of Pfam 

2 5 protein domains and families.) 

Additionally, the template sequences were translated in all three forward reading frames, and 
each translation was searched against hidden Markov models for signal peptides using the HMMER 
software package. Construction of hidden Markov models and their usage in sequence analysis has 
been described. (See, for example, Eddy, S.R. (1996) Curr. Opia Str. Biol. 6:361-365.) Only those 

3 o signal peptide hits with a cutoff score of 1 1 bits or greater are reported. A cutoff score of 1 1 bits or 

greater corresponds to at least about 91-94% true-positives in signal peptide prediction. Template 
sequences ware also translated in all three forward reading frames, and each translation was searched 
against TMAP, a program that uses weight matrices to delineate transmembrane segments on protein 
sequences and determine orientation, with respect to the cell cytosol (Persson, B. and P. Argos (1994) J. 
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MoL Biol. 237:182-192; Persson, B. and P. Argos (1996) Protein Sci. 5:363-371.) Regions of 
templates which, when translated, contain similarity to signal peptide or transmembrane consensus 
sequences are reported in Table 3. 

The results of HMMER analysis as reported in Tables 2 and 3 may support the results of 
5 BLAST analysis as reported in Table 1 or may suggest alternative or additional properties of template- 
encoded polypeptides not previously uncovered by BLAST or other analyses. 

Template sequences are further analyzed using the bioinformatics tools listed in Table 7, or 
using sequence analysis software known in the art such as MACDNASIS PRO software (Hitachi 
Software Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Template 

1 o sequences may be further queried against public databases such as the GenBank rodent, mammalian, 

vertebrate, prokaryote, and eukaryote databases. 

The template sequences were translated to derive the corresponding longest open reading frame 
as presented by the polypeptide sequences. Alternatively, a polypeptide of the invention may begin at 
any of the methionine residues within the full length translated polypeptide. Polypeptide sequences 
15 were subsequently analyzed by querying against the GenBank protein database (GENPEPT, (GenBank 
version 121)). Full length polynucleotide sequences are also analyzed using MACDNASIS PRO 
software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 
(DNASTAR). Polynucleotide and polypeptide sequence alignments are generated using default 
parameters specified by the CLUSTAL algorithm as incorporated into the MEGALIGN multisequence 

2 o alignment program (DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 6 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (GENPEPT) database. Column 1 shows the polypeptide 
sequence identification number (SEQ ID NO:) for the polypeptide segments of the invention. Column 2 
shows the reading frame used in the translation of the polynucleotide sequences encoding the 

2 5 polypeptide segments. Column 3 shows the length of the translated polypeptide segments. Columns 4 
and 5 show the start and stop nucleotide positions of the polynucleotide sequences encoding the 
polypeptide segments. Column 6 shows the GenBank identification number (GI Number) of the nearest 
GenBank homolog. Column 7 shows the probability score for the match between each polypeptide and 
its GenBank homolog. Column 8 shows the annotation of the GenBank homolog. 

30 V. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 
and involves the hybridization of a labeled nucleotide sequence to a membrane on which RN As from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel, 1995, 
supra , ch. 4 and 16.) 
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Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
5 The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
5 x minimum {length(Seq. 1), length(Seq. 2)} 

1 o The product score takes into account both the degree of similarity between two sequences and the length 

of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 
assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 

15 every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment. For 
example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared. A product score of 70 is produced either by 100% identity and 

20 70% overlap at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 

VI. Tissue Distribution Profiling 

A tissue distribution profile is determined for each template by compiling the cDNA library 

2 5 tissue classifications of its component cDNA sequences. Each component sequence, is derived from a 

cDNA library constructed from a human tissue. Each human tissue is classified into one of the 
following categories: cardiovascular system; connective tissue; digestive system; embryonic structures; 
endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cells; hemic and immune 
system; liver; musculoskeletal system; nervous system; pancreas; respiratory system; sense organs; 
30 skin; stomatognathic system; unclassified/mixed; or urinary tract. Template sequences, component 
sequences, and cDNA library/tissue information are found in the LIFESEQ GOLD database (Incyte 
Genomics, Palo Alto CA). 

Table 5 shows the tissue distribution profile for the templates of the invention. For each 
template, the three most frequently observed tissue categories are shown in column 3, along with the 
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percentage of component sequences belonging to each category. Only tissue categories with 
percentage values of > 10% are shown. A tissue distribution of "widely distributed" in column 3 
indicates percentage values of <10% in all tissue categories. 

5 VIL Transcript Image Analysis 

Transcript images are generated as described in Seilhamer et al., "Comparative Gene 
Transcript Analysis," U.S. Patent Number 5,840,484, incorporated herein by reference. 

VIII. Extension of Polynucleotide Sequences and Isolation of a Full-length cDNA 

1 o Oligonucleotide primers designed using an mddt of the Sequence Listing are used to extend the 

nucleic acid sequence. One primer is synthesized to initiate 5' extension of the template, and the other 
primer, to initiate 3' extension of the template. The initial primers may be designed using OLIGO 4.06 
software (National Biosciences, Inc. (National Biosciences), Plymouth MN), or another appropriate 
program, to be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to 
15 anneal to the target sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides 
which would result in hairpin structures and primer-primer dimerizations are avoided. Selected human 
cDNA libraries are used to extend the sequence. If more than pne extension is necessary or desired, 
additional or nested sets of primers are designed. 

High fidelity amplification is obtained by PCR using methods well known in the art. PCR is 

2 o performed in 96- well plates using the PTC-200 thermal cycler. (MJ Research). The reaction mix 

contains DNA template, 200 nmol of each primer, reaction buffer containing Mg 24 *, (NH 4 ) 2 S0 4 , and fi- 
mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life 
Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 
PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68 °C, 2 
25 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ are as follows: Step 1: 94°C, 3 min; Step 2: 
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to determine which reactions are successful in extending the sequence. 

The extended nucleotides are desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
5 shotgun sequencing, the digested nucleotides are separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments are excised, and agar digested with AGAR ACE (Promega). Extended clones are 
religated using T4 ligase (New England Biolabs, Inc., Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells are selected on antibiotic-containing 

10 media, individual colonies are picked and cultured overnight at 37°C in 384-well plates in LB/2x 
carhenicillin liquid media. 

The cells are lysed, and DNA is amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 
94°C, 3 min; Step 2: 94°C, 15 sec; Step 3; 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 

15 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA is quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries are reamplified 
using the same conditions as described above. Samples are diluted with 20% dimethysulfoxide (1 :2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 

2 o sequencing ready reaction kit (Applied Biosystems). 

In like manna:, the mddt is used to obtain regulatory sequences (promoters, introns, and 
enhancers) using the procedure above, oligonucleotides designed for such extension, and an appropriate 
genomic library. 

25 IX. Labeling of Probes and Southern Hybridization Analyses 

Hybridization probes derived from the mddt of the Sequence Listing are employed for 
screening cDNAs, mRNAs, or genomic DNA. The labeling of probe nucleotides between 100 and 
1000 nucleotides in length is specifically described, but essentially the same procedure may be used 
with larger cDNA fragments. Probe sequences are labeled at room temperature for 30 minutes using a 
30 T4 polynucleotide kinase, y^P-ATP, and 0.5X One-Phor-All Plus (Amersham Pharmacia Biotech) 
buffer and purified using a ProbeQuant G-50 Microcolumn (Amersham Pharmacia Biotech). The 
probe mixture is diluted to 10 7 dpm/|ag/ml hybridization buffo: and used in a typical membrane-based 
hybridization analysis. 

The DNA is digested with a restriction endonuclease such as Eco RV and is electrophoresed 
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through a 0.7% agarose gel. The DNA fragments are transferred from the agarose to nylon membrane 
(NYTRAN Plus, Schleicher & Schuell, Inc., Keene NH) using procedures specified by the 
manufacturer of the membrane. Prehybridization is carried out for three or more hours at 68 °C, and 
hybridization is carried out overnight at 68 °C. To remove non-specific signals, blots are sequentially 
5 washed at room temperature under increasingly stringent conditions, up to 0. lx saline sodium citrate 
(SSC) and 0.5% sodium dodecyl sulfate. After the blots are placed in a PHOSPHORIMAGER cassette 
(Molecular Dynamics) or are exposed to autoradiography film, hybridization patterns of standard and 
experimental lanes are compared. Essentially the same procedure is employed when screening RNA. 

10 X, Chromosome Mapping of mddt 

The cDNA sequences which were used to assemble SEQ ID NO: 1-45 are compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that match SEQ 
ID NO:l-45 are assembled into clusters of contiguous and overlapping sequences using assembly 
15 algorithms such as PHRAP (Table 7). Radiation hybrid and genetic mapping data available from 

public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome 
Research (WIGR), and G6n6thon are used to determine if any of the clustered sequences have been 
previously mapped. Inclusion of a mapped sequence in a cluster will result in the assignment of all 
sequences of that cluster, including its particular SEQ ID NO:, to that map location. The genetic map 

2 o locations of SEQ ID NO: 1 -45 are described as ranges, or intervals, of human chromosomes. The map 

position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances 
25 are based on genetic markers mapped by G6n6thon which provide boundaries for radiation hybrid 
markers whose sequences were included in each of the clusters. 

XI. Microarray Analysis 

Probe Preparation from Tissue or Cell Samples 

3 o Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

polyA + RNA is purified using the oligo (dT) cellulose method. Each polyA* RNA sample is reverse 
transcribed using MMLV reverse-transcriptase, 0.05 pg/|il oligo-dT primer (21mer), IX first strand 
buffer, 0.03 units/|il RNase inhibitor, 500 nM dATP, 500 ^iM dGTP, 500 |iM dTTP, 40 \iM dCTP, 
40 \iM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse franscription 
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reaction is performed in a 25 ml volume containing 200 ng polyA + RNA with GEMB RIGHT kits 
(Incyte). Specific control polyA + RNAs are synthesized by in vitro transcription from non-coding yeast 
genomic DNA (W. Lei, unpublished). As quantitative controls, the control mRNAs at 0.002 ng, 0.02 
ng, 0.2 ng, and 2 ng are diluted into reverse transcription reaction at ratios of 1 :100,000, 1 :10,000, 
5 1:1 000, 1:100 (w/w) to sample mRNA respectively. The control mRNAs are diluted into reverse 
transcription reaction at ratios of 1:3, 3:1, 1:10, 10:1, 1:25, 25:1 (w/w) to sample mRNA differential 
expression patterns. After incubation at 37° C for 2 hr, each reaction sample (one with Cy3 and another 
with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 20 minutes at 
85° C to the stop the reaction and degrade the RNA Probes are purified using two successive 
10 CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. (CLONTECH), Palo 
Alto CA) and after combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 
mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The probe is then dried to completion 
using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 jllI 5X SSC/0.2% 
SDS. 

15 

Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
2 o amplified in thirty cycles of PCR from an initial quantity of 1 -2 ng to a final quantity greater than 5 |J.g. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR Scientific 

2 5 Products Corporation (VWR), West Chester, PA), washed extensively in distilled water, and coated 

with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C oven. 

Array dements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 jal of the array element DNA, at an average 
concentration of 100 ng/jxl, is loaded into the open capillary printing element by a high-speed robotic 

3 o apparatus. The apparatus then deposits about 5 hi of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford, MA) for 30 minutes at 60° C followed by washes in 0.2% 
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SDS and distilled water as before. 

Hybridization 

Hybridization reactions contain 9 jllI of probe mixture consisting of 0.2 ng each of Cy3 and 
5 Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The probe mixture 
is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with an 1 .8 
cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly larger 
than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 \il of 
5x SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 6.5 
10 hours at 60°C. The arrays are washed for 10 min at 45°C in a first wash buffer (IX SSC, 0.1% SDS), 
three times for 10 minutes each at 45°C in a second wash buffer (0.1X SSC), and (fried 

Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
15 Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cmx 1.8 cm array used in the present example is scanned witha 
2 o resolution of 20 micr ometers . 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 

2 5 emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 

typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the probe mix at a known concentratioa A specific location on the 

3 o array contains a complementary DNA sequence, allowing the intensity of the signal at that location to 

be correlated with a weight ratio of hybridizing species of 1 : 1 00,000. When two probes from different 

sources (e.g., representing test and control cells), each labeled with a different fluorophore, 

are hybridized to a single array for the purpose of identifying genes that are differentially expressed, the 
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calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and adding 
identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RT1-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood, MA) installed in an IBM-compatible PC 
5 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fluorophores using each fiuorophore's emission spectrum. 
10 A grid is superimposed over the fluorescence signal image such that the signal from each spot 

is centered in each element of the grid The fluorescence signal within each element is then integrated to 
obtain a numerical value corresponding to the average intensity of the signal. The software used for 
signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

15 XII. Complementary Nucleic Acids 

Sequences complementary to the mddt are used to detect, decrease, or inhibit expression of the 
naturally occurring nucleotide. The use of oligonucleotides comprising from about 15 to 30 base pairs 
is typical in the art. However, smaller or larger sequence fragments can also be used. Appropriate 
oligonucleotides are designed from the mddt using OLIGO 4.06 software (National Biosciences) or 

2 o other appropriate programs and are synthesized using methods standard in the art or ordered from a 

commercial supplier. To inhibit transcription, a complementary oligonucleotide is designed from the 
most unique 5 9 sequence and used to prevent transcription factor binding to the promoter sequence. To 
inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal binding and 
processing of the transcript. 

25 

XIII- Expression of MDDT 

Expression and purification of MDDT is accomplished using bacterial or virus-based 
expression systems. For expression of MDDT in bacteria, cDNA is subcloned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels of 

3 o cDNA transcriptioa Examples of such promoters include, but are not limited to, the trp-lac (tac) 

hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator 
regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., 
BL21(DE3). Antibiotic resistant bacteria express MDDT upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of MDDT in eukaryotic cells is achieved by infecting insect 
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or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNP V), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding MDDT by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
5 polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See e,g. , Engelhard, 
supra ; and Sandig, supra .) 

In most expression systems, MDDT is synthesized as a fusion protein with, e.g., glutathione S- 

io' transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from MDDT at 

is specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak 
Company, Rochester NY). 6-His, a stretch of six consecutive histidine residues, enables purification on 
metal-chelate resins (QIAGEN). Methods for protein expression and purification are discussed in 
Ausubel (1995, supra . Chapters 10 and 16). Purified MDDT obtained by these methods can be used 

20 directly in the following activity assay. 

XIV. Demonstration of MDDT Activity 

MDDT, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent. 
(See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
2 5 previously arrayed in the wells of a multi- well plate are incubated with the labeled MDDT, washed, and 
any wells with labeled MDDT complex are assayed. Data obtained using different concentrations of 
MDDT are used to calculate values for the number, affinity, and association of MDDT with the 
candidate molecules. 

Alternatively, molecules interacting with MDDT are analyzed using the yeast two-hybrid 
30 system as described in Fields, S. and 0. Song (1989) Nature 340:245-246, or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (CLONTECH). 

MDDT may also be used in the PATHCALLBMG process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
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between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

XV. Functional Assays 
5 MDDT function is assessed by expressing mddt at physiologically elevated levels in 

mammalian cell culture systems. cDNA is subcloned into a mammalian expression vector containing a 
strong promoter that drives high levels of cDNA expression. Vectors of choice include pCMV SPORT 
(Life Technologies) and pCR3.1 (Invitrogen Corporation, Carlsbad CA), both of which contain the 
cytomegalovirus promoter. 5-10 ^ig of recombinant vector are transiently transfected into a human cell 
10 line, preferably of endothelial or hematopoietic origin, using either liposome formulations or 

electroporatioa 1-2 jxg of an additional plasmid containing sequences encoding a marker protein are 
co-transfected. 

Expression of a marker protein provides a means to distinguish transfected ceils from 
nontransfected cells and is a reliable predictor of cDN A expression from the recombinant vector. 

15 Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; CLONTECH), CD64, or a 
CD64-GFP fusion protein. Flow cytometry (FCM), an automated laser optics-based technique, is used 
to identify transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the 
cells and other cellular properties. 

FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding 

20 or coincident with cell death. These events include changes in nuclear DN A content as measured by 
staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward 
light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 
proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane 

2 5 composition as measured by the binding of fluorescdn-conjugated Annexin V protein to the cell 

surface. Methods in flow cytometry are discussed in Qrmerod, M. G. (1994) Flow Cytometry . Oxford, 
New York NY. 

The influence of MDDT on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding MDDT and either CD64 or CD64-GFP. CD64 and 

3 o CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 

immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Inc., Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art 
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Expression of mRNA encoding MDDT and other genes of interest can be analyzed by northern analysis 
or microarray techniques. 

XVI. Production of Antibodies 

5 MDDT substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the MDDT amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding peptide is synthesized 

1 o and used to raise antibodies by means known to those of skill in the art Methods for selection of 

appropriate epitopes, such as those near the C -terminus or in hydrophilic regions are well described in 
the art (See, e.g., Ausubel, 1995, supra , Chapter 11.) 

Typically, peptides 1 5 residues in length are synthesized using an ABI 43 1 A peptide 
synthesizer (Applied Biosystems) using fmoc-chemistry and coupled to KLH (Sigma) by reaction with 
15 N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase immunogenicity. (See, e.g., 
Ausubel, supra .) Rabbits are immunized with the peptide-KLH complex in complete Freund's 
adjuvant. Resulting antisera are tested for antipeptide activity by, for example, binding the peptide to 
plastic, blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio- 
iodinated goat anti-rabbit IgG. Antisera with antipeptide activity are tested for anti-MDDT. activity 

2 o using protocols well known in the art, including ELIS A, RIA, and immunoblotting. 

XVII. Purification of Naturally Occurring MDDT Using Specific Antibodies 
Naturally occurring or recombinant MDDT is substantially purified by immunoaffinity 

chromatography using antibodies specific for MDDT. An immunoaffinity column is constructed by 

2 5 covalently coupling anti-MDDT antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing MDDT are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of MDDT (e.g., high ionic strength 

3 o buffers in the presence of detergent). The column is eluted under conditions that disrupt 

antibody/MDDT binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and MDDT is collected 
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All publications and patents mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described method and system of the invention 
will be apparent to those sldlled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with specific preferred embodiments, it should 
be understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the above-described modes for carrying out the invention which are 
obvious to those skilled in the field of molecular biology or related fields are intended to be within the 
scope of the following claims. 
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i ^•07Tfip3 1 •2nnnPPRia 

L.V3**7r ( OOJ. l.&wUrCD IO 


373 


459 


forward 1 


TM 


Nl in 

IV If 1 
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i r^»QT7fiR3 i*9nnnPPRip 

LA3.9/ / DOO. I xUUUrCD IO 


657 
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forward 3 


TM 

1 IVI 


N out 

IV UUl 


o 
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forward 3 


TM 

1 IVI 


N nut 

IV wUl 


o 
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LG'Q80m3 1'2000FFR1fl 


313 


375 


forward 1 


TM 


Nl out 


o 
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LG'9801 53 1 '2000FEB1 8 


391 


453 


forward 1 


TM 


N out 


Q 
w 


LG -9801 53 1*2000FEB18 


278 


364 


forward 2 


TM 


N out 
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LG'980153 V2000FEB18 


416 


493 


forward 2 


TM 


N out 
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LG'980153 1-2000FEB18 


809 


871 


forward 2 


TM 


N out 
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LG'980153 1"2000FEB18 


902 


964 


forward 2 


TM 


N out 

1 ' will 
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LG'980153 V2000FEB18 


1181 


1264 


forward 2 


TM 


N out 


3 
o 


1 G'Q801^ 1'2flOTfFFR1ft 


1427 


1510 


forward 2 


TM 


N nut 

IV UUl 


a 
o 


i G'Qroi*v* i*?nnnFFRiR 


1733 


1798 


forward 2 


TM 

1 IV| 


M out 

IV WUl 


q 
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i fVQftm^3 i*9onnPPRm 


1868 


1954 


forward 2 


TM 

1 IV| 


Nl out 


o 
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I G-QRom^ i # 9nnnFFRiR 


2141 


2227 


forward 2 


TM 

1 IVI 


N out 

1 V WUl 
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1 f^'QR0l^3 1-9DnnFPR1R 

LU«90w I Sw* I .cUUUrCD I o 


2261 
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forward 2 


TM 

1 IVI 


N out 

iv wui 


Q 
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L.V3.90U 1 SO* I .CUWruD 1 0 


60 
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forward 3 


TM 

I IVI 


Nl in 

IV If 1 


O 

o 


1 G*QR01^3 1*2nnnFFR1R 

LVJI.90U 1 SO* 1 .£UUUilD 1 O 


402 


476 


forward 3 


TM 

1 IVI 


N in 

IV II 1 


3 


i fi-QRoi^ i*pnnnFFRiR 

UV3.90V/ 1 SO* 1 .CUuUruD 1 O 


2031 


2081 


forward 3 


TM 

1 IVI 


Nl in 

IV II 1 
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LG"Q801^ 1'P0n0FFR1fl 


2142 


2213 


forward 3 


TM 

1 IVI 


N in 

IV II | 


c 
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I G*.i7wm i # PonnFFRiR 


2134 


2208 


forward 1 


TM 

1 IVI 


Nl in 

IV II 1 


c 
O 


i G-47^m 1-pnnnFFRiR 

U,V3.*T r SOS 1 • 1 .CUUUrCD 1 0 


2039 


2125 


forward 2 


TM 

1 IVl 


Nl nut 
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c 
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i rv 47*^^1 ionnftFPRiR 

L.Va.*fr f DDD I . I .cvUUrCD I o 


1167 
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forward 3 


TM 

1 IVl 


Nl in 

lv III 


D 
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927 


forward 1 


TM 

1 IVI 
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i r-v4AiA07 9-onnnPPRiR 


949 


1035 


forward 1 


TM 

1 IVI 
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I fV4.R1.dft7 9*9ftftftFFR1R 


1081 


1161 


forward 1 
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1 IVI 
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1510 


1584 


forward 1 
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1 IVI 




e 
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LV3.HO 1 Hwf •LiC.UuUrCD 1 O 


1355 


1435 


forward 2 


TM 

1 IVI 


Ni out 

IV WUl 


c 
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i G-4fli4n7 9'pnnnFFRiR 


1439 


1525 


forward 2 


TM 

1 IVI 


Nl out 

IV WUl 


R 
u 


I G'4R140,7 2*2000FFR18 


1326 


1409 


forward 3 
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Nl in 
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D 
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1446 


1526 


forward 3 


TM 

1 IVI 


N in 

IV II 1 
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1545 


1616 


forward 3 


TM 


N in 


7 


1 l-44^RR0 1 -P00OFFBO1 
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forward 2 


TM 
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Nl out 

IV WUl 


1 u 


I G' 171 377 1'2ftftOMAY1Q 

l_V3 * 1 r IO / / .I -*C wwwIVIM 119 


318 
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forward 3 


TM 

1 IVI 
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1 V II 1 


in 
i w 
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l_V3* 1 / 1 Of / • 1 •buUUIVln 1 1 9 
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635 


forward 3 


TM 

1 IVI 


N in 


in 

1 W 
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Lu . If 1 O/ / * 1 ••CwwwJVlr\ I 1 «7 
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forward 3 


TM 

1 IVI 


N| j n 


12 
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1381 


1461 


forward 1 


TM 


N| in 


12 
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1710 


forward 1 


TM 

I IVI 


N in 




I G , P-473R4 1 'POO0MAY1 9 


1409 


1495 


forward 2 


TM 


N in 




L G-P47384 1-P000MAY19 


1395 


1481 


forward 3 


TM 


N in 


12 


1 G'P47^R4 1*P000MAY19 


1617 


1679 


forward 3 


TM 


N in 


13 


LG*403872 1*2000MAY19 


535 


621 


forward 1 


TM 


N in 


13 


LG-403872 1 *2000MAY1 9 


1360 


1446 


forward 1 


TM 


N in 


13 


LG-403872 1 ^OOOMAYI 9 


1522 


1581 


forward 1 


TM 


N in 


13 
i w 


LG-403872 1 '2000MAY1 9 
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1902 


forward 1 


TM 


N in 


13 

1 W 


LG'403872 1'2000MAY19 


1957 


2022 


forward 1 


TM 


N in 


13 


LG-403872 1 '2000MAY1 9 
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349 


forward 2 


TM 


Nin 


13 


LG-403872 1 :2000 M AY1 9 


1361 


1423 


forward 2 


TM 


Nin 


13 


LG.403872.1 :2000MAY1 9 


1439 


1501 


forward 2 


TM 


Nin 


13 


LG:403872.1 :2000MAY19 


1553 


1627 


forward 2 


TM 


Nin 


13 


LG:403872.1 :2000MAY19 


1859 


1918 


forward 2 


TM 


Nin 


13 


LG:403872.1 :2000MAY19 


2027 


2110 


forward 2 


TM 


Nin 


13 


LG:403872.1 :2000MAY19 


2117 


2203 


forward 2 


TM 


Nin 


13 


LG.403872.1 :2000MAY19 


369 


452 


forward 3 


TM 


Nin 
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LG:403872.1 :2000MAY1 9 


549 
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forward 3 
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TM 


N in 
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forward 3 
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forward P 

1 wl WCII VJ £» 


TM 

1 IVI 


M oi it 

IN UUl 


14 


LG:1 1 3521 3.1 :2000MAY1 9 
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forward P 
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forward P 
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M out 

IN UUl 


16 


LG:3421 47. 1 :2000M AY1 9 
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204 


forward 1 
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TM 

1 IVI 


N out 

IN UUl 


16 


LG.3421 47.1 :2000MAY1 9 
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forward Q 

iui vvaiu w 


TM 

1 IVI 


M oirt 

IN UUl 


17 


LG:1 097300.1 :2000MAY1 9 


487 
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forward 1 

iwi wen w i 


TM 

1 IVI 
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LG' 1 097300 1 '2000MAY1 9 
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forward 1 
iui vvaiu i 


TM 
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forward 1 
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forward P 
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forward P 
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17 


LG: 1097300.1 :2000MAY1 9 

■ ww r www. i ifavwif in 1 1 w 
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forward P 
iui vvcii \j £— 


TM 

1 IVI 
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522 
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w » w 


forward VK 

1 wl WCII U W 


TM 
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In in 


17 


LG:1 097300 1 '2000MAY1 9 


750 


836 

www 


forward 
iui wcii u w 


TM 

1 IVI 
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In in 


17 


LG:1 097300 1 :2000MAY1 9 


894 
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www 


forward 

IUI VVCll LI w 


TM 

1 IVI 


N in 
in in 


17 


LG'1 097300 1 '2000MAY1 9 
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1 www 
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1 1 *TW 


forward *^ 
iui vvcii Li w 


TM 

1 IVI 


in 
in in 


18 


LG:444850.9'2000MAY1 9 
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315 

W 1 W 


forward 1 
iui wcii u i 


TM 

1 IVI 


hi in 

IN III 


19 


LG'402231 62000MAY19 


407 


484 
tot 


forward P 

IUI WCII LI Cm 


TM 

1 IVI 


M in 
In in 


23 


LG'350793 2*20Q0MAY1 9 


148 




fonwa rd 1 
iui wcii li i 


TM 

1 IVI 


M in 
In in 


23 


LG-350793 2*2000MAY1Q 


w l w 


oot 


forwa rH 1 

IUI Weil U I 


TM 
1 IVI 


M in 

in in 


23 


LG'350793 2-2000MAY19 


1144 


1P15 


forward 1 
iui well u i 


TM 

1 IVI 


M in 
in in 


23 
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1 IVI 
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In in 


23 
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1 www 


1 tCJ 


forward 1 
iui Well \J i 
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forward 1 
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forward 1 

i ui wcii u i 


TM 

1 IVI 


Nl in 

IN III 


23 
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forward 1 

IUI WCII LI 1 


TM 

1 IVI 


IM in 

IN III 


23 


LG:350793.2:2000MAY1 9 


1514 


1600 


forward P 

iui vvcii u Cm 


TM 

1 IVI 




23 


LG'350793 2-2000MAY19 


2135 


2221 


forward P 

iui wcii u Cm 


TM 

1 IVI 




23 
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«a»V*«.WWW # W Wlfalb wwwi Tin ■ 1 W 


1422 


1493 


forward 3 

i ui wcii \j \j 


TM 

1 IVI 




23 
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•..wtaWWW » W W • £»<£a»WWWI 1 1 W 


2268 


2354 

*— W W~ 


forward 3 

iui v»aiu u 


TM 

1 IVI 




24 


LG'408751 .3-2000MAY1 9 

%^»^ww# w ■ •WifevvviFin • i w 


1202 


1264 


forward P 

iui wcii u Cm 


TM 

1 IVI 


N oirt 

IN UUl 


24 


LG'408751 .3-2000MAY1 9 

b*M< T^WW t W 1 iWibWVIlin 1 1 W 


1137 


1223 


forward % 

iui wcii u w 


TM 

1 IVI 


IM in 

IN III 


25 


Ll'336120 1-2000MAY01 

•™I»WWW 1 fa»W* • .bWWWlVfr* IVI 


241 


297 

Cw # 


forward 1 

IUI WQIU 1 


TM 

1 IVI 


IN III 


25 


Ll:336120.1 '2000MAY01 


616 


702 


forward 1 

IUI WCIIVJ 1 


TM 

1 IVI 


Nl in 

IN II 1 


25 


Ll:3361 20.1 '2000MAY01 

«»1.WWW 1 >a.W. t lUV V Vlf l( » IWI 


1141 


1200 


forward 1 

IUI VVCII U 1 


TM 

1 IVI 


N in 

IN II 1 


25 


Ll:3361 20.1 -2000MAY01 


2524 


2598 


forward 1 

i W 1 VVul VJ 1 


TM 

1 IVI 


N in 

IN II 1 


25 


LI.3361 20. 1 :2000MAY01 


1163 


1213 


forward 2 

iwi v y ai w Cm 


TM 

1 IVI 


N in 

IN II 1 


25 


LI.3361 20.1 :2000MAY01 


1922 


1972 


forward 2 

IWI UUIU (— 


TM 

1 IVI 


N in 

1 M II 1 


25 


LI.3361 20.1 :2000MAY01 


2060 


2119 


forward 2 

IWI UUIU w 


TM 


N in 

1 V II 1 


25 


Ll:3361 20.1 :2000MAY01 


2510 


2596 


forward 2 

IWI llv(l\J b 


TM 


IS| in 


25 


Ll:3361 20.1 :2000MAY01 


663 


749 


forward 3 

IWIWUIW w 


TM 


N in 


25 


LI.3361 20.1 :2000MAY01 


1380 


1445 


forward 3 


TM 


N in 


25 


Ll:3361 20.1 :2000MAY01 


1839 


1925 


forward 3 

1 \J 1 ft UIW \tw 


TM 


Nin 


25 


LI.3361 20.1 :2000MAY01 


2148 


2234 


forward 3 

If Ul W %mW 


TM 


N in 


25 


LI336120 1-2000MAY01 


P418 


247 1 


forward 3 

IUI VVCll U *J 


TM 

1 IVI 


N in 

IN III 


25 


LI:336120.1:2000MAY01 


2499 


2585 


forward 3 


TM 


Nin 


pfi 
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TM 
1 IVI 


M nut 
IN OUI 


26 


LI:234104.2:2000MAY01 


2155 


2241 


forward 1 


TM 


Nout 


26 


LI:234104.2:2000MAY01 


3616 


3690 


forward 1 


TM 


Nout 
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tow 


Ll*2341 04 2-2000MAY01 

i>iitov~ 1 w~.e« .to v w ui tin ■ w ■ 


1112 


1168 


forward 2 


TM 


N in 


26 

cw 
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phosphoinositol 3 -phosphate-binding protem-2 
[Homo sapiens] 


TPC2 [unidentified] 1 


KIAA0969 protein [Homo sapiens] | 
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KIAA1431 protein [Homo sapiens] | 


BC37295_1 [Homo sapiens] 1 


R31665_2 [Homo sapiens] I 


1 RING-fmger protein MURF [Mus musculus] j 


hypothetical protein [Macaca fascicular is] 1 


unnamed protein product [Homo sapiens] 1 


unnamed protein product [Homo sapiens] | 


unnamed protein product [Homo sapiens] 1 


KIAA0455 protein [Homo sapiens] J 


1 MGA protein [Mus musculus] 


T-box family member; T-box domain [Cynops 
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transcription factor TBX6 (Homo sapiens] 1 


| RING finger protein [Xenopus laevis] 


estrogen-responsive B box protein [Homo 
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CLAIMS 

What is claimed is: 

1 . An isolated polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQIDNO:l-45, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-45, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a) through d). 

2. An isolated polynucleotide of claim 1, comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:l-45. 

3. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide 
of claim 1. 

4. A composition for the detection of expression of disease detection and treatment molecule 
polynucleotides comprising at least one of the polynucleotides of claim 1 and a detectable label. 

5. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

6. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
comprising a sequence of a polynucleotide of claim 1 , the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex 
is formed between said probe and said target polynucleotide or fragments thereof, and 
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b) detecting the presence or absence of said hybridization complex, and, optionally, if present, 
the amount thereof. 



7. A method of claim 5, wherein the probe comprises at least 30 contiguous nucleotides. 

8. A method of claim 5, wherein the probe comprises at least 60 contiguous nucleotides. 

9. A recombinant polynucleotide comprising a promote sequence operably linked to a 
polynucleotide of claim 1 . 

10. A cell transformed with a recombinant polynucleotide of claim 9. 

.1 1 . A transgenic organism comprising a recombinant polynucleotide of claim 9. 

12. A method for producing a disease detection and treatment molecule polypeptide, the 
method comprising: 

a) culturing a cell under conditions suitable for expression of the disease detection and 
treatment molecule polypeptide, wherein said cell is transformed with a recombinant polynucleotide of 
claim 9, and 

b) recovering the disease detection and treatment molecule polypeptide so expressed, 

1 3. A purified disease detection and treatment molecule polypeptide (MDDT) encoded by at 
least one of the polynucleotides of claim 2. 

14. An isolated antibody which specifically binds to a disease detection and treatment molecule 
polypeptide of claim 13. 

15. A method of identifying a test compound which specifically binds to the disease detection 
and treatment molecule polypeptide of claim 13, the method comprising the steps of: 

a) providing a test compound; 

b) combining the disease detection and treatment molecule polypeptide with the test 
compound for a sufficient time and undo: suitable conditions for binding; and 
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c) detecting binding of the disease detection and treatment molecule polypeptide to the test 
compound, thereby identifying the test compound which specifically binds the disease detection and 
treatment molecule polypeptide. 

16. A microarray wherein at least one element of the microarray is a polynucleotide of claim 3. 

17. A method for generating a transcript image of a san^le which contains polynucleotides, 
the method comprising the steps of: 

a) labeling the polynucleotides of the sample, 

b) contacting the dements of the microarray of claim 16 with the labeled polynucleotides of the 
sample under conditions suitable for the formation of a hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 

18. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a polynucleotide sequence of claim 1 , the 
method comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying amounts of 
the compound and in the absence of the compound. 

19. A method for assessing toxicity of a test compound, said method comprising: 

a) treating a biological sample containing nucleic acids with the test compound; 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
least 20 contiguous nucleotides of a polynucleotide of claim 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 1 
or fragment thereof; 

c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the amount 
of hybridization complex in the treated biological sample is indicative of toxicity of the test compound. 
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20. An array comprising different nucleotide molecules affixed in distinct physical locations on 
a solid substrate, wherein at least one of said nucleotide molecules comprises a first oligonucleotide or 
polynucleotide sequence specifically hybridizable with at least 30 contiguous nucleotides of a target 
polynucleotide, said target polynucleotide having a sequence of claim 1 . 

5 

21 . An array of claim 20, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

22. An array of claim 20, wherein said first oligonucleotide or polynucleotide sequence is 
10 completely complementary to at least 60 contiguous nucleotides of said target polynucleotide 

23. An array of claim 20, which is a microarray. 

24. An array of claim 20, further comprising said target polynucleotide hybridized to said first 
1 5 oligonucleotide or polynucleotide. 

25. An array of claim 20, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 

2 o 26. An array of claim 20, wherein each distinct physical location on the substrate contains 

multiple nucleotide molecules having the same sequence, and each distinct physical location on the 
substrate contains nucleotide molecules having a sequence which differs from the sequence of 
nucleotide molecules at another physical location on the substrate. 

25 27. An isolated polypeptide comprising an amino acid sequence selected from the group 

consisting of: 

a) an amino acid sequence selected from the group consisting of SEQ ID NO:46-90, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO:46-90, 

30 c) a biologically active fragment of an amino acid sequence selected from the group 

consisting of SEQ ID NO:46-90, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
ofSEQIDNO:46-90. 
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<110> INCYTE GENOMICS, INC. 
PANZER , Scott R. 
SPIRO, Peter A. 
BANVILLE, Steven C. 
SHAH, Purvi 
CHALUP, Michael S. 
CHANG, Simon C. 
CHEN, Alice 
D'SA, Steven A. 
AMSHEY, Stefan 
DAHL, Christopher R. 
DAM, Tam C. 
DANIELS, Susan E. 
DUFOUR, Gerard E. 
FLO RES , Vincent 
FONG, Willy T. 
GREENAWALT, Li la B. 
HILLMAN, Jennifer L. 
JONES, Anissa L. 
LIU, Tommy F. 
ROSEBERRY, Ann M. 
ROSEN, Bruce H. 
RUSSO, Frank D . 
STOCKDREHER, Theresa K. 
DAFFO, Abel 
WRIGHT, Rachel J. 
YAP, Pierre E. 
YU, Jimmy Y. 
BRADLEY, Diana L. 
BRATCHER, Shawn R. 
CHEN, Wensheng 
COHEN, Howard J. 
HODGSON, David M. 
LINCOLN, Stephen E. 

<120> MOLECULES FOR DISEASE DETECTION AND TREATMENT 

<130> PT-1133 PCT 

<140> To Be Assigned 
<141> Herewith 

<1S0> 60/185,213/ 60/205,285; 60/205,232; 60/205,323; 60/205,287; 

60/205,324; 60/205,286 
<151> 2000-02-24; 2000-05-17; 2000-05-16; 2000-05-17; 2000-05-17; 

2000-05-17; 2000-05-17 

<160> 90 

<170> PERL Program 

<210> 1 

<211> 1378 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 977683 . 1 : 2000FEB18 
<220> 

<221> unsure 
<222> 1355 

<223> a, t, c, g, or other 
<400> 1 

caggagatgg cggcggcggc ggctagggat cagacatggc ggcggatctg aacctggagt 60 
ggatctccct gccccggtcc tggacttacg ggatcaccag gggcggccga gtcttcttca 120 
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tcaacgagga ggccaagagc accacctggc tgcaccccgt caccggcgag gcggtggtca 180 
ccggacaccg gcggcagagc acagatttgc ctactggctg ggaagaagca tatacttttg 240 
aaggtgcaag atactatata aaccataatg aaaggaaagt gacctgcaaa catccagtca 300 
caggacaacc atcacaggac aattgtattt ttgtagtgaa tgaacagact gttgcaacca 360 
tgacatctga agaaaagaag gaacggccaa taagtatgat aaatgaagct tctaactata 420 
acgtgacttc agattatgca gtgcatccaa tgagccctgt aggcagaact tcacgagctt 480 
caaaaaaagt tcataatttt ggaaagaggt caaattcaat taaaaggaat cctaatgcac 540 
cggttgtcag acgaggttgg ctttataaac aggacagtac tggcatgaaa ttgtggaaga 600 
aacgctggtt tgtgctttct gacctttgcc tcttttatta tagagatgag aaagaagagg 660 
gtatcctggg aagcatactg ttacctagtt ttcagataag ctttgcttac cctctgaaga 720 
tcacattaat cgcaaatatg cttttaaggc agcccatcca aacatgcgga cctattattt 780 
ctgcactgat acaggaaagg aaatggagtt gtggatgaaa gccatgttag atgctgccct 840 
agtacagaca gaacctgtga aaagagtgga caagattaca tctgaaaatg caccaactaa 900 
agaaaccaat aacattccca accatagagt gctaattaaa ccagagatcc aaaacaatca 960 
aaaaaacaag gaaatgagca aaattgaaga aaaaaaggca ttagaagctg aaaaatatgg 1020 
atttcagaag gatggtcaag atagaccctt aacaaaaatt aatagtgtaa agctgaattc 1080 
tctgccatct gaatatgaga gtgggtcagc atgccctgct cagactgtgc actacagacc 1140 
aatcaacttg agcagttcag agaacaaaat agtcaatgtt agcctggcag atcttagagg 1200 
tggaaatcgc cccaatacag ggcccttata cacagaggcc gatcgagtca tacagagaac 1260 
aaattcaatg cagcagttgg aacagtggat taaaatccag aaggggaggg gtcatgaaga 1320 
agaaaccagg ggagtaattt cttaccaaac attancaaga aatatgccaa gtcacaga 1378 

<210> 2 

<211> 662 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_jf eature 

<223> Incyte ID No: LG : 893 050 . 1 : 200 0FEB18 • 
<400> 2 

gggtcttaga gtttaccttc tacttccttt agagtgtctt cgcttttctc agggcacttg 60 
gaggtcctaa aactgctggt ggcacgggga gcagacctcg gctgcaaggc ccgcaagggc 120 
tatgggctgc tccatacagc tgctgccagt ggccagattg aagtggtgaa gtacctgctt 180 
cggatgggag cggagatcga tgaacccaat gcttttggaa acacagcttt gcacatcgcc 240 
tgctacctgg gccaggatgc tgtggctatt gagctggtga atgccggagc caatgtcaac ' 3 00 
cagccgaatg acaagggctt cacgccactg catgtggctg cagtctcgac caatggcgct 3 60 
ctctgcttgg agctactggt taataatggg gctgacgtca actaccagag caaagaaggg 420 
aaaagtcctc tgcacatggc tgcaatccat ggccgtttca cacgctccca gatcctcatc 480 
cagaatggca gcgagattga ttgtgccgac aaatttggga acacgccact gcatgtggct 540 
gctcgatatg gacacgagct gctcatcagc accctcatga ccaatggcgc agataccggc 600 
cggcgtggca tccatgacat gttccccctg cacttagctg ttctctttgg attctctgac 660 
tg 662 

<210> 3 

<211> 2764 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 980153 . 1 : 2000FEB18 
<220> 

<221> unsure 
<222> 2663 

<223> a, t, c, g, or other 
<400> 3 

ccccgttccc gattcatgta gtagcggctg tattgcagcc gcctgccgaa ctgacccggg 60 
tctggggact ggcccctctg gcgccgttcg gtttctctta ttgccttcac tgaggatgag 120 
tccctttgtg gctctatgtg gaccctgcgg aatccaccgg cgcagtttca tctagcgact 180 
ggtcaccctt gcaattatgg atatttaaaa gggtcagaca gtgtggaggg ggagttcccc 240 
tcctcactcc cccttggtgc ttgactccag gaataattta taaactgtgg aattttttta 3 00 
aatgaagaac ttgtatttga tatgaacttt atagagctat ttataatttt tttgatttaa 3 60 
gtgccaaaaa aattgtataa agatatatag ttttatacta ttgtcaggag gatttaaatt 420 
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atcctaaaaa ggtaatttat tctctgtaac ttcctcaata gcacctttgt gtcctggctt 480 

tttcattttt taaaattagt tttcacgatt ctgaagtaag tggtataaaa acagttagga 540 

tgagttcacc catgcctgac tgcacatcaa agtgtcgatc cctgaagcat gctttggaag 600 

tcccttctgt ggtaacaaag gggagcgaaa acccgattaa ggcccttctc tccacgtcat 660 

tgttacaaag ctgccactat caaggatgtt tttggcagga atgccctcca cccctgtttc 720 

ctcctcgtgg agaagaaagg agtgttagat tggcttattc agaaaggagt ggatctgttg 780 

gtgaaagaca aagagtctgg atggacagca ttgcacagaa gcatttttta tggacatatt 840 

gattgtgttt ggtctctatt gaagcatggt gttagtctgt atattcaaga taaagaaggc 900 

ttgtcagctt tggatcttgt aatgaaggat agaccaactc atgtagtatt caagaatact 9 60 

gatcctacag atgtttatac ttggggcgat aatacaaatt ttaccctggg tcatggaagc 1020 

cagaatagca aacatcatcc agagttggtg gatctgttct ccaggagtgg gatttatatc 1080 

aagcaggtgg tgctttgtaa atttcactcc gtgtttctgt ctcagaaagg gcaggtttat 1140 

acctgtggtc atggtcctgg agggcgatta ggacatggag atgaacagac atgcttggtc 1200 

cctcggcttg tggaaggact gaatggtcat aattgttccc aagtggcagc tgctaaggat 1260 

catactgttg tattaactga agatggatgt gtttatacat ttggtctaaa catttttcat 1320 

caattaggaa ttattccacc gccttccagt tgtaatgtac ccagacagat acaggcaaaa 1380 

tatctgaaag gaaggacaat cattggcgtt gcagcaggca ggtttcatac agtcctatgg 1440 

actagagaag ctgtttacac tatgggacta aatggtggac aactgggttg tttgctagat 1500 

cccaatggag aaaagtgtgt aactgctcct cgtcaggtct ctgcccttca ccataaagac 1560 

attgctctgt ctttggttgc tgcaagtgat ggagctacag tctgtgttac cacaagggga 1620 

gatatttact tacttgcaga ctatcagtgc aagaagatgg cttctaaaca gttgaacttg 1680 

aaaaaagttc ttgtgtctgg gggtcatatg gaatacaagg ttgatcctga acatttgaaa 1740 

gaaaatgggg gtcaaaaaat ttgcattctt gcaatggatg gagctggaag ggtgttttgc 1800 

tggagatcag tcaacagttc tctgaagcag tgtcgatggg cctatccacg tcaggtcttc 1860 

atttctgata ttgctttaaa tagaaatgaa attctatttg ttacgcaaga tggagaagga 1920 

tttagaggga gatggtttga agagaaaaga aagagttctg aaaagaaaga gattttatca 1980 

aaccttcaca attcctcatc agatgtgtct tatgtctctg atataaatag tgtgtatgaa 2040 

agaattcgac ttgagaaact tacctttgca catagagctg ttagtgtcag cacagatcca 2100 

agtggatgca actttgcaat cctgcagtca gatcctaaaa caagccttta tgaaattcca 2160 

gctgtgtcct catcatcctt ttttgaagag tttggcaaac tgttgaggga agcagatgaa 2220 

atggacagca ttcatgatgt gacatttcaa gttggcaata gactcttccc tgcacataaa 2280 

tatattttgg cagtgcattc tgattttttt cagaaattgt ttctttcaga tggtaatact 2340 

tcagaattta cagatattta ccagaaagat gaagattctg cagggtgcca tctctttgtg 2400 

gtagagaagg ttcatcctga catgtttgaa taccttttac aatttatata cacagatact 2460 

tgtgactttt taactcatgg cttcaaacca agaatacact taaacaaaaa cccagaagaa 2520 

tatcagggaa ctctgaattc tcatttgaat aaagtgaatt tccatgaaga tgataaccag 2580 
aagtctgcat ttgaagttta caaaagtaat caagctcaaa cagttagtga gaggcagaag' 2640 

agcaaaccta aatcttgtaa aanaggaaaa aatattaggg aagatgatcc tgtaagaatg 27 00 

ttgcaaactg ttgcaaagaa attcgacttc agtaatttga gtagtaggtt agatggagtc 2760 

agat 27 64 

<210> 4 

<211> 388 , 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG:350398 . 1 :2000FEB18 
<220> 

<221> unsure 
<222> 125 

<223> a, t, c, g, or other 
<400> 4 

cccttctacg tccgctgcat caagcccaat gaggacaagg tagctgggaa gctggatgag 60 

aaccactgtc gccaccaggt cgcatacctg gggctgctgg agaatgtgag ggtccgcagg 120 

gctgnttcgc ttcccgccag ccctactctc gattcctgct caggtactgg cacctgacac 180 

ccatcactcc atgggccata gtccctgtgt ggagtccaag gggtaggagc agagggtccc 240 

caaacagcac gtcgcaaaca tcgatacaag caggaaccag cacgctgctg gcctcaagac 300 

accaaaatat ctgggaagac atgtgtgtga gcacatgcat gtggggacat acaggtggga 360 

acatgggtat gagggctgtg tgaggaca 388 

<210> 5 
<211> 2364 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 47 5551.1: 200 0FEB18 
<220> 

<221> unsure 
<222> 424-425 

<223> a, t, c, g, or other 
<400> 5 

gtctgcaggg ccagagcggg gcagacatgg acaagcgggt gaagaagctt cccctcatgg 60 
ctctgtccac cacgatggct gagagcttca aggagctgga ccctgattcc agcatgggga 120 
aggccttgga gatgagctgt gccatccaga atcagctggc ccgcatcctg gccgagtttg 180 
agatgaccct ggagagggac gtcctgcagc cactcagcag gctgagtgag gaggagctgc 240 
cagccatcct caaacacaag aaaagcctcc agaagctcgt gtccgactgg aacacactca 300 
agaacaggct cagtcaggca accaagaatt caggcagcag tcaaggccta ggaggcagcc 3 60 
cgggtagtca cagccatacg accatggcca acaaggtgga gacgctgttc tactgcagca 420 
ggtnntcacc caggaaagtg gagcaatgca gggacgagta cttggctgac ctgtaccact 480 
ttgttaccaa ggaggactcc tatgccaact acttcattcg tctcctggag attcaggccg 540 
attaccatcg caggtcactg agctcgctgg acacagccct ggctgagctg agggagaacc 600 
acggccaagc agaccactcc ccttcgatga cagccaccca cttccccagg gtgtatgggg 660 
tgtcgctggc aacccacctg caagagctgg gccgggagat tgccctgccc atcgaggcct 720 
gcgtcatgat gctgctttct gagggcatga aggaagaggg tctcttccgt ctggctgctg 780 
gggcctcggt gctgaagcgt ctcaagcaga caatggcctc ggacccccac agcctggagg 840 
agttctgctc cgacccgcac gctgtggcag gtgccctcaa gtcctatctg cgggagctgc 900 
cagagcctct gatgaccttc gacctctatg atgactggat gagggcagcc agcctgaagg 960 
agccaggggc ccggctgcag gccctccaag aggtgtgcag ccgcctaccc cccgagaacc 1020 
tcagcaacct caggtacctg atgaagttcc tggcacggct ggccgaggag caggaggtga 1080 
acaagatgac acccagcaac atcgccatag tcctgggacc caacttgctg tggccacctg 1140 
agaaagaagg ggaccaggcc cagctggatg cagcctccgt gtcttccatc caggtggtgg 1200 
gcgtcgtcga ggcgctgatc cagagcgcag acaccctctt ccctggagac atcaacttca 1260 
acgtgtcagg cctcttctca gctgttaccc tccaggacac agtcagtgac aggctggcct 1320 
ctgaggaact tccgtccact gccgtgccca ccccagccac caccccggct ccggctccgg 13 80 
ctccagctcc agctccggcc ccagccttgg cttcagcagc taccaaggaa aggacagagt 1440 
ctgaggtgcc tcccagacca gcctccccca aggtcaccag gagtcccccg gagacagctg 1500 
ccccagtgga ggacatggct cggaggacca agcgcccggc gccagcccgg cccaccatgc 1560 
cgccccccca ggtctccggc tcccgc.tcct cccctccagc cccgcccttg ccccctggct 1620 
ctggcagccc tgggaccccc caagccctgc cccgacgtct ggttggcagc agcctccgag 1680 
cccccacagt gccacccccg ttacccccca, caccccctca gcctgcccgg cgccaaagcc 1740 
ggcgttcacc agcctccccc agcccggcct ccccaggtcc agcctccccc agcccagtct 1800 
ctttgagtaa ccctgcacag gtggacctgg gggctgccac agcagaggga ggagcccctg 1860 
aggctatcag tggggtcccc actcccccag ctatcccccc tcagccccgc cccaggagcc 1920 
ttgcctcaga gaccaactga gtggctggtt tctccctaag cagccctcag caccccctcc 1980 
ctccccacct ggccctccca ggacagctct cgccccccac aaaggggcat gggcctccag 2040 
cctttgccca caagtgcctc agtgcccact gggtcggccc ccatggccag gagggctcag 2100 
gacaatcctc tatttcctga ccttttcctc gtccaccctg ggcttgggga cccccccacc 2160 
ggactctcca ctctccggca ggtcctaggg gagccaccgg aaggaaggag aggtttgcct 2220 
gctcctacgg gactgattct tctcttgccg acatgttttt tgtaaggctg gtaaataaat 2280 
tattttggac aaaactggag cagctgccca aatgatagtt ttattttctg tccttgaaat 2340 
aaagaagcca attttataaa gggg 23 64 

<210> 6 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 481407 . 2 : 2000FEB18 
<400> 6 

cgccttcggg gcccggatct caaacagtcg ggaagaagca ccgtggctgc tattatctgc 60 
tctccgcgcc tgacccctcc caggactcgt gatgccaagg ccgctgcgag cggctacgaa 120 
gagtcggggt tgagccccag ctgagccgag ggctcgcact cttctggtct cccaggccca 180 
acccacctga agaaatgagt ggtggattgg ctccaagtaa gagcacagtg tatgtatcca 240 
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acttgccttt 
ttgtaaaggt 
ttttattttt 
tatttggtag 
tccgaaggcg 
taagttatgc 
aaaaagaaaa 
gaagatgaag 
gccaaaattg 
gatgattcaa 
agtgattaaa 
gaagagctaa 
tcatcccaag 
ttcatgcctg 
aagaccgaaa 
tgtctttaaa 
aattgtaaat 
taaaaggctt 
cattccctag 
tcagtcactt 
cctcaatatt 
aataaccaag 
cctagagtta 
cttcccattc 
gctttgtata 
aaaagcaata 

g 



ttccctgaca 
taccatcatg 
ggataaagac 
agtgataaaa 
aaactacttt 
ctgtccgaaa 
aaaagaaagc 
gggaggatcc 
aagaagaaca 
gacgcccaag 
atcttgcccc 
ttttaataaa 
aaaaaagata 
atttgtttac 
agttgaatga 
tatgggtgtt 
acctataaaa 
aaaatcaaac 
aagaaactac 
aataaaagaa 
cccaatgtgg 
taaatacttt 
ataattgtct 
tgtctgcagg 
tttctatatt 
gaaataagtt 



aacaatgact 
aaagataaag 
tctgcacaaa 
gcaagcattg 
gataaatcta 
aatatgctcg 
tcctgaacca 
tgctcttgac 
aaaaaaatgg 
gataaagaaa 
agcacagtaa 
aataagaaaa 
ttttaaaaat 
atgaggactt 
tttattgtta 
ttgtcatcat 
taaattataa 
acaggtgtta 
ttacatgtgc 
taaaggagaa 
ctgtattaaa 
tcatttctga 
attttttttc 
tgggtatatg 
gtttaacttt 
aatactatca 



tgtaccggat 
ataccaggaa 
actgtaccag 
ctattgacaa 
agtgttatga 
gagaacgtga 
gaagaagaaa 
agcctcagtc 
aaacccagt't 
agcacatatt 
taaaaatcaa 
attaatacta 
ttatttgaaa 
ctgaaattaa 
cttatattaa 
atttcttgta 
atacctataa 
tttgtctgcc 
acacatgtag 
ttattcaatc 
aatttggggg 
gccaaggaat 
catgtattgt 
gtaatgggat 
gtaagaatgc 
tagtaatatt 



attttccaag 
gagtaaaggg 
ggcaataaac 
tggaagagca 
atgtggggaa 
gcctccaaag 
ttgaggaagt 
aggccatagc 
caggagtccc 
tcagtgatga 
gatttgttag 
tcatgttaat 
agttcattat 
tccttaaaac 
taaacttttc 
gctttatccc 
aatataaagt 
ctacccatag 
ctaaataaag 
tcttatactt 
aatccatata 
atgctatgat 
ctttgtattt 
tagagagcct 
ccattacttt 
attattgtca 



tatggcaaag 
gttgcattta 
aacaaacagt 
gctgagttca 
agtggacact 
aagaaagaaa 
agaagaaagt 
attccagcaa 
ctcaacatca 
ggaagaactt 
taacaatctt 
actattattg 
aagggcttta 
aaacttcctg 
aagagaattt 
aatctggata 
aacatagctc 
caccaaattc 
tacatatatt 
ctcctacctt 
tatctttttt 
tacgtttttt 
atgactaaat 
ttaattttct 
tttaactagt 
tcccgaggcg 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1801 



<210> 7 

<211> 730 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



<220> 

<221> unsure 
<222> 44 
<223> a, t, c, 



LI: 443580. 1:2000FEB01 



g, or other 



<400> 7 

ggaggtgaga 

atgactggaa 

agtgaacaca 

atctccattc 

ctacaagcag 

gaggaatggg 

aatttcagca 

ttggaacaag 

gatctggaat 

gtaataatta 

caaaaaacta 

caaaactcac 

aaggaaaaaa 



tattttggtc 
agggttagga 
acctttcccc 
ctcatgtgtg 
tatccaaggg 
aatgcctgga 
acctggtttc 
gaaaagagcc 
caatgtgtga 
cccgtgaaga 
tgagtgaaga 
aatttatcca 



cccaggagaa 
aacctctctg 
tgagccactg 
gtgtcaccca 
gccggtgatg 
cgctgatcag 
agtgggactt 
ctggatggtt 
gaccaaaata 
catgtctact 
gaaaccatgg 
acatcagaga 



ctggctcagg 
aggtctggtc 
gaattggaca 
gaagaggagg 
ttcagggatg 
atgaatttat 
tccaattcta 
gatagagagc 
ttatctctaa 
tttattcagc 
gaatgtaaga 
attcattttg 



tctncaagtt 
agattccaac 
gaatgcccca 
aaagaatgca 
tttccataga 
acaaagaagt 
agccagctgt 
tgactagagg 
agaagagaca 
ccacatttct 
tatgtggaaa 
gtgaaaaaca 



cccatccggg 60 
cctggacagc 120 
ttctcctctg 180 
tgatgaactt 240 
cttctctcaa 300 
gatgttggag 3 60 
gatctcctta 420 
cctgtgttca 480 
tttcagtcaa 540 
tattccacct 600 
gacctttaat 660 
ctatgaatct 720 
730 



<2.10> 8 

<211> 457 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No; LI : 803015 . 1 : 2000FEB01 



<400> 8 
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gcgcgggctg cggctgggat ccggtctttc cagccccgag agggacctgg ttcctctgcc 60 
caggcttctg tcactctgtc acctacgcta tgccctgctg tagtcacagg aggtgtagag 120 
aggaccccgg gacatctgaa agccaggaaa tggacccagt ggcctttgat gatgttgctg 180 
tgaacttcac ccaggaggag tgggctttgc tggatatttc ccagaggaaa ctctacaagg 240 
aagtgatgct ggaaactttc aggaacctga cctctgtagg aaaaagttgg aaagaccaga 300 
acattgaata tgagtaccaa aaccccagga gaaacttcag gagtctcata gaaaagaaag 3 60 
tcaatgaaat taaagatgac agtcattgtg gagaaacttt tacccaggtt ccagatgaca 420 
ggctgaactt ccaggagaag aaagcttctc ctgaaat 457 

<210> 9 

<211> 582 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG; 027410 . 3 : 2000MAY19 
<400> 9 

ggcacaccga ggctcggccg ccccgccgcg agtcctggat cagtgacatt cgagcaggaa 60 
ccgccccttc atgcaggaac cacatcaaat caagctgcag cttgatcgcc ttcaactccg 120 
accgtcctgg tgtactgggc attgtgcctc tgcaaggcca aggagaggac aagcgacgcg 180 
tggcccacct gggctgccat tcagacctag tcaccgactt ggacttctcg ccctttgatg 240 
acttcctcct ggccacaggc tcggctgaca ggacggtaaa actctggcga ctgccagggc 300 
ctggccaggc cctgccctca gcacccgggg tggtgctggg ccccgaggac ctcccagtgg 360 
aggtactgca gttccacccc acctctgacg gcattctgag ctggcagccc atggggacct 420 
ggtgcagagc gccgtctgga gccgagatgg agccctggtg ggcacggcgt gcaaggacaa 480 
gcagctgcgg atctttgacc ccagaacaaa gccgcgggcc tctcagagca cgcaggccca 540 
tgagaacagc agggatagcc ggctggcatg gatgggcacc tg 582 

<210> 10 

<211> 848 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 171377 . 1 :2000MAY19 
<400> 10 

agcggccgca gcctctgaga gcacgaacag cagcgccccc gcgtcccagc cagccagcca 60 
gccagactgg actccggccc accgacggcc gctcgcgctc cggccccgct cgcctgctct 120 
gccccggacc tgcagctccc cgctcccccg ccgtgtccgc cgcctcccgg ccagagagcc 180 
aagcccccac gccgcgccca gccgtcgccg cgccgagcat gtcctcgacc gagaggcgcc 240 
cggcgggacg gcgggacagg tcgccgcgcc agcaggtgga ccgcctactc gtggggctgc 3 00 
getggcggcg gctggaggag cegctgggct tcatcaaagt tctccagtgg ctctttgcta 3 60 
ttttcgcctt cgggtcctgt ggctcctaca gcggggagac aggagcaatg gttcgctgca 420 
acaacgaagc caaggacgtg agctccatca tcgttgcatt tggctatccc tgcaggttgc 480 
accggatcca atatgagatg cccctctgcg atgaagagtc cagctccaag accatgcacc 540 
tcatggggga cttctctgca cccgccgagt tcttcgtgac ccttggcatc ttttccttct 600 
tctataccat ggctgcccta gttatctacc tgcgcttcca caacctctac acagagaaca 660 
aacgcttccc gctggtggac ttctgtgtga ctgtctcctt caccttcttc tggctggtag 720 
ctgcagctgc ctggggcaag ggcctgaccg atgtcaaggg ggccacacga ccatccagct 780 
tgacagcagc catgtcagtg tgccatggag aggaagcagt gtgcagtgcc ggggccacgc 840 
cctctatg 848 

<210> 11 

<211> 636 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 352559 . 1 :2000MAY19 
<400> 11 

tgtagtttcc tcaactactg cctcagctct acaatcccag agtaaagctc ttctccaaat 60 
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gaagagccag gaagaggtag 
ttcactgact ttcacagatg 
tcttgctcag agaagtttgt 
agtgggtctt tgcatttcta 
ttgggtgata aaaggaggga 
gaccaaatca ttatctttaa 
aatggaaaga cttaaaagct 
tgaagacttg tttgagaggg 
cactcatata gatactctta 
tcatctgaat acattatctt 

<210> 12 
<211> 2110 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> miscjfeature 
<223> Incyte ID No: LG:247384.1:2000MAY19 

<400> 12 

ccaggagaag gaagccaaca ggatccgacc cggtgttttg tgacaaaggc aagaccccca 60 
ggtctactta .gagcaaagtt agtagaggag gcagctaggc gtggctctca ttccttccca 120 
cagaatggat tataagtcga gcctgatcca ggatgggaat cccatggaga acttggagaa 180 
gcagctgatc tgccctatct gcctggagat gtttaccaag ccagtggtca tcttgccgtg 240 
ccagcacaac ctgtgccgga agtgtgccaa tgatattttc caggcctcta acccgtattt 3 00 
gcccacaaga ggaggtacca ccatggcatc agggggccga ttccgctgcc catcctgtag 3 60 
acatgaagtg gttttggata gacatggggt atatggactt cagaggaacc tgctggtgga 420 • 
gaacatcatc gacatctaca aacaggagtg ctccagtcgg ccgctgcaga agggcagtca 480 
ccccatgtgc aaggagcacg aagatgagaa aatcaacatc tactgtctca cgtgtgaggt 540 
gcccacctgc tccatgtgca aggtgtttgg gatccacaag gcctgcgagg tggccccatt 600 
gcagagtgtc ttccagggac aaaagactga actgaataac tgtatctcca tgctggtggc 660 
ggggaatgac cgtgtgcaga ccatcatcac tcagctggag gattcccgtc gagtgaccaa 720 
ggagaacagt caccaggtaa aggaagagct gagccagaag tttgacacgt tgtatgccat 780 
cctggatgag aagaaaagtg agttgctgca gcggatcacg caggagcagg agaaaaagct 840 
tagcttcatc gaggccctca tccagcagta ccaggagcag ctggacaagt ccacaaagct 900 
ggtggaaact gccatccagt ccctggacga gcctggggga gccaccttcc tcttgactgc 960 
caagcaactc atcaaaagca ttgtggaagc ttccaagggc tgccagctgg ggaagacaga 1020 
gcagggcttt gagaacatgg acttctttac tttggattta gagcacatag cagacgccct 1080 
gagagccatt gactttggga cagatgagga agaggaagaa ttcattgaag aagaagatca 1140 
ggaagaggaa gagtccacag aagggaagga agaaggacac cagtaaggag ctggatgaat 1200 
gagaggcccc cagatgcaga gagactggag agggtgggga ggggcccagc ggccttggtg 1260 
acaggcccag ggtgggaggg gtcggggccc ctggaggggc aatggggagg tgatgtcttc 1320 
tctctgctca gagagcaggg actagggtag gaccctcacc gctgcgtcca gcagacactg 1380 
aaccagaatt ggaaacgtgc ttgaaacaat cacacaggac acttttctac attggtgcaa 1440 
aatggaatat tttgtacatt tttaaaatgt gatttttgta tatacttgta tatgtatgcc 1500 
aatttggtgc tttttgtaaa ggaacttttg tataataatg cctggtcatt gggtgacctg 1560 
cgattgtcag aaagagggga aggaagccag gttgatacag ctgcccactt cctttcctga 1620 
gcaggaggat ggggtagcac tcacagggac gatgtgctgt atttcagtgt ctatcccaga 1680 
catacggggt ggtaactgag tttgtgttat atgttgtttt aataaatgca caatgctctc 1740 
ttcctgttct tcaaaggagc cggggtttca ttcagccttt ttttcctgga gatgagggtt 1800 
gagtgtgaat gaacaggacc cctggtagga ggcaatggca gggctaggct taggtcccag 1860 
taaaggagtt ctcgacacca ccatttccca atgtggactc catggaaagc cagccctgag 1920 
ctggtccttc aagaacaggt tcaatgtgtt gttgctctgg ttctccagaa aacagagcct 1980 
gaggcaaaat ttaaatgctt tagttgcagg ttatagggac ttccccgtgc tcactgaagg 2040 
ctcactgaag gctcactgaa aatcatcaag aagaggcaga ttaaggctgg gcaggtgcag 2100 
tggttcatgc 2110 

<210> 13 

<211> 2375 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:403872 . 1 :2000MAY19 



aggtggcagg aattaaactt 
tggccataga cttttcccaa 
acaagaaggt gatgttagaa 
aaccagatgt gatctcctta 
tgaacagagg cctgtgccca 
accaggatat ttatgaagaa 
atgaccttga atgttcaaca 
agcttgtaaa ccagaagaca 
ttgaaaaaag agatcactct 
atataaaaca gatttt 



tgtaaagcca tgtccctggg 120 
gatgaatggg agtggctgaa 180 
aactacagga acctagtttc 240 
ctggagcaag agaaagaccc 300 
gacttggagt gtgtgtgggt 360 
aaattacccc cggcaatcat 420 
ttagggaaaa actggaaatg 480 
cattttaggc aagagaccat 540 
aacaaatctg ggacagtttt 600 

63 6 
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<220> 

<221> unsure 
<222> 1233 

<223> a, t, c, g, or other 
<400> 13 

gcagcgccag gaggaggcag cggaggaagc agagcgcggg atgggcgccc agcggcatct 60 
gtgatcccgc gcacctccgc cccacgggcg cgcgcacaaa cacggacaca cacatacaca 120 
cactcgcgca cacactcgca caaacacaca ctcgtacacg cccgcgccgc tcgctcgccg 180 
gcttgctctc ccacgcaagc ggaatgcagc agcgcctgga gagcgtgtct cggaccgccg 240 
cctgaatgta cctcgctccc gggagccgga cggcccagta gggcgcactg gaggacgctc 300 
cgctgcggga gcctggacag tttttgacgg tgcagtcttg ctatatggtg tgagaaatgg 360 
ctgtaggaaa caacactcaa cgaagttatt ccatcatccc gtgttttata tttgttgagc 420 
ttgtcatcat ggctgggaca gtgctgcttg cctactactt cgaatgcact gacacttttc 480 
aggtgcatat ccaaggattc ttctgtcagg acggagactt aatgaagcct tacccaggga 540 
cagaggaaga aagcttcatc acccctctgg tgctctattg tgtgctggct gccaccccaa 600 
ctgctattat ttttattggt gagatatcca tgtatttcat aaaatcaaca agagaatcce 660 
tgattgctca ggagaaaaca attctgaccg gagaatgctg ttacctgaac cccttacttc 720 
gaaggatcat aagattcaca ggggtgtttg catttggact ttttgctact gacatttttg 780 
taaacgccgg acaagtggtc actgggcact taacgccata cttcctgact gtgtgcaagc 840 
caaactacac cagtgcagac tgccaagcgc accaccagtt tataaacaat gggaacattt 900 
gtactgggga cctggaagtg atagaaaagg ctcggagatc ctttccctcc aaacacgctg 960 
ctctgagcat ttactccgcc ttatatgcca cgatgtatat tacaagcaca atcaagacga 1020 
agagcagtcg actggccaag ccggtgctgt gcctcggaac tctctgcaca gccttcctga 1080 
caggcctcaa ccgggtctct gagtatcgga accactgctc ggacgtgatt gctggtttca 1140 
tcctgggcac tgcagtggcc ctgtttctgg gaatgtgtgt ggttcataac tttaaaggaa 1200 
cgcaaggatc tccttccaaa cccaagcctg agnatccccg tggagtaccc ctaatggctt 1260 
tcccaaggat agaaagccct ctggaaacct taagtgcaca gaatcactct gcgtccatga 1320 
ccgaagttac ctgagacgac tgatgtgtca caagctgttt tttaaaatca tcttccaatt 1380 
ctatacttca aaacacacag ttgctcaatg tcaaactgtg atgacaaata ttacgtttat 1440 
ctagttagaa gctaatgttt tgtacatttt ttgtatgagg aagtgatgta gcttgccctg 1500 
attttttttt tttttttttg gtcagcttta atatatttat gccagaattt taaaaccaac 1560 
aaaattttct tgttcaagcg tgcattgaag aaccacattt attcaatggt tgacgttgtt 1620 
ttgtgatatt tgtacacaaa ttttcttttc tcagttttat aaacacagaa gtaaatataa 1680 
caattcactt taaactttta ttaccacagt tgctgcctcc tccagaattt ttgaatttta 1740 
ataaaaggca aacttttgag ctgcaggaag gacaatgttg gttaataata aatctcaaag 1800 
tcaattgtag aaaaaaaatt gtcttcaaaa agaatgttgc actctgatct cttaacaaat 1860 
tgttacgttc aaagtttaaa gtgatatatt aacaaagtca cctagttata caaacaattg 1920 
tcagagaatt ctggatttgg agggtattgg ggttatatga ttctttctta gataatggcc 1980 
tctactaaat aactcaagat ctttctggaa tgtcttctgg caggcaggtg ccactgtcag 2040 
cttttctcca aaaagcagcc aacatcagcc tcccctgtca actcaacagt tttgtatctc 2100 
atattatatg gactttatat gaaaatgaat attttacagt ttgcacagta ttattttaca 2160 
gaaaaggaat cagagaatct acaacatagg gccccagaac aacagtttca ctttgtggct 2220 
tttaattatt ctagaatttt aactgcatct catttttcta gcatggtgag aactaatatg 2280 
taactccttt gattgaagga gctcttttgt ccgtacctat cagaatgttt tcttgacact 2340 
tccatgttgg ctcttctcag ctttttttgt acata 2375 

<210> 14 

<211> 537 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 1135213 . 1 :2000MAY19 
<400> 14 

ggacccgcga gcggagcggc gcgtgggtcg gttgcggtcg gccccggcag gatgggaagg 60 
ccattgtgac tatgtggtga ttacagttgt cttactactg agtttcctac tgaaatcatg 120 
gaggagaaac agcagattat attggctaat caagatggtg gaacagtggc aggagcagca 180 
cctaccttct ttgtcatctt aaagcagcca ggaaatggca aaactgatca aggaattttg 240 
gttactaatc aggatgcctg tgctttggct agtagtgtgt catcaccagt aaaatctaaa 3 00 
gggaagattt gccttccagc tgattgtact gtgggtggaa tcactgttac cctcgataac 3 60 
aatagtatgt ggaatgagtt ctatcatcga agcacagaga tgattctgac caagcaagga 420 
agacgcatgt ttccttactg tcgttattgg ataacaggtt tagattcaaa tttgaagtat 480 
attcttgtca tggatatatc tcctgtggat aaccatcgtt ataagtggaa tggtcgt 537 



8/69 



WO 01/62922 



PCT/US01/05896 



<210> 15 

<211> 1433 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:474284 . 2 :2000MAY19 
<400> 15 

ggcctgcccc ggccccctgc ccgcggcgcc atggcggaga attggaagaa ctgcttcgag 60 
gaggagctca tctgccctat ctgcctgcac gttttcgtgg agccagtgca gctgccgtgc 120 
aaacacaact tctgccgggg ctgcatcggc gaggcgtggg ccaaggacag cggcctcgta 180 
cgctgcccag agtgcaacca ggcctacaac cagaagccgg gcctggagaa gaacctgaag 240 
ctcaccaaca tcgtggagaa gttcaatgcc ctgcacgtgg agaagccgcc ggcggcgctg 3 00 
cactgcgtgt tctgccgccg cggccccccc gctgcccgcg cagaaggtct gcctgcgctg 360 
cgaggcgccc tgctgccagt cccacgtgca gacgcacctg cagcagccct ccaccgcccg 420 
cgggcacctc ctggtggagg cggacgacgt gcgggcctgg agctgcccgc agcacaacgc 480 
ctaccgcctc taccactgcg aggccgagca ggtggccgtg tgccagtact gctgctacta 540 
cagcggcgcg catcagggac actcggtgtg cgacgtggag atccgaagga atgaaatccg 600 
gaagatgctc atgaagcagc aggaccggct ggaggagcga gagcaggaca ttgaggacca 660 
gctgtacaaa ctcgagtcag acaagcgcct ggtggaggag aaagtgaacc aactgaagga 720 
ggaagttcgg ctgcagtacg agaagctgca ccagctgctg gacgaggacc tgcggcagac 780 
agtggaggtc ctagacaagg cccaggccaa gttctgcagc gagaacgcag cgcaggcgct 840 
gcacctcggg gagcgcatgc aggaggccaa gaagctgctg ggctccctgc agctgctctt 900 
tgataagacg gaggatgtca gcttcatgaa gaacaccaag tctgtgaaaa tcctgatgga 960 
cagcagatgc cccgtccact ggccccagga cccagacctg cacgagcagc agcctttccc 1020 
ccactaagat cggccacctg aactccaagc tcttcctgaa cgaagtggcc aagaaggaga 1080 
agcagctgcg gaaaatgcta gaaggcccct tcagcacgcc ggtgcccttc ctgcagagtg 1140 
tccccctgta cccttgcggc gtgagcagct ctggggcgga aaagcgcaag cactcaacgg 1200 
ccttcccaga ggccagtttc ctagagacgt cgtcgggccc tgtgggcggc cagtacgggg 1260 
cggcgggcac agccagcggt gagggccagt ctgggcagcc cctggggccc tgcagctcca 1320 
cgcagcactt ggtggccctg ccgggcggcg cccaaccagt gcactcaagc cccgtgttcc 1380 
ccccatcgca gtatcccaat ggctccgcgc ccagcagccc atgctccccc agt 1433 

<210> 16 

<211> 654 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 342147 . 1 : 2000MAY19 
<400> 16 

cgaattgggc ccctagatgt ttgctcgagc ggcggccgca gtgtgctgga aagggacaaa 60 
gacttgtaac tggagaaata gtttgtaagg gagatttttc ttcctctacc cacacctttc 120 
aaggcaggga gcaatgaaag acaaacctgt actgttcacc atatttcatt gafctgcaata 180 
ggagtattga ggtcactttt atattgtcct ggatagtatg tagttacgcg gtttgtaaag 240 
agaggaatgg gatggggggc tgtgagaagg aagaattagt ggtcgatttc ggaggagcag 300 
gatggagatc cctgtgcctg tgcagccgtc ttggctgcgc cgcgcctcgg ccccgttgcc 3 60 
cggactttcg gcgcccggac gcctctttga ccagcgcttc ggcgaggggc tgctggaggc 420 
cgagctggct gcgctctgcc ccaccacgct cgccccctac tacctgcgcg cacccagcgt 480 
ggcgctgccc gtcgcccagg tgccgacgga ccccggcccc ttttcggtgc tgctagacgt 540 
gaagcacttc tcgccggagg acattgctgt caaggtggtg ggcgaacacg tggaggtgca 600 
cgcgcgccac gaggagcgcc cggatgagca cggattcgtc gcgcgcgagt tcca 654 

<210> 17 

<211> 1651 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:1097300.1:2000MAY19 
<400> 17 
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gccgccgagg aggaggccct 
gttcatcgga aacctgcccc 
gtatgggaag gtgctggaat 
caagacggca gctgaggatg 
catcaacgcg gaagccagca 
catcagcccc acctgcacca 
catcgaatgt gacatcgcga 
agcggaggcc atcaggggcc 
ggtagtagcg ccgagtgggg 
gctgggtggt ctcttccatt 
aagcataatt tacccctggg 
ctcaccgact gcatagaatc 
gattcgagtg ctgttggaag 
gaggataaag gtccgtggct 
agggtttaag tgcttaagat 
aagaaaaaag acatctagag 
gcgggagtat ctttttgttg 
atgccatgga gtgttctgct 
tatatcagac tcaatgggaa 
tagagcatat catactattt 
aagaggttgt cagaagactc 
attgaaatta acagattaga 
gtttcaagca ctcagtagtc 
tttccatcat catagaacat 
tatcgtcttg gacctcatgt 
gccctgagtg tcattaaagg 
aaaacaaaat gagaaaaata 
gtttgcatag tggagatggg 

<210> 18 
<211> 1870 
<212> DMA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: LG : 444850 . 9 : 2000MAY19 

<220> 

<221> unsure 

<222> 1865, 1867 

<223> a, t, c, g, or other 

<400> 18 

ggctctgaag ccattacaaa ggttgcttaa cttctaatta tttgatcact gaggaaaatc 60 
cagaaagcta cacaacactg aaggggtgaa ataaaagtcc agcgatccag cgaaagaaaa 120 
gagaagtgac agaaacaact ttacctggac tgaagataaa agcacagaca agagaacaat 180 
gccctggaca tggctccaga gatccacatg acaggcccaa tgtgcctcat tgagaacact 240 
aatggggaac tggtggcgaa tccagaagct ctgaaaatcc tgtctgccat tacacagcct 3 00 
gtggtggtgg tggcaattgt gggcctctac cgcacaggaa aatcctacct gatgaacaag 3 60 
ctagctggga agaataaggg cttctctctg ggctccacag tgaaatctca caccaaagga 420 
atctggatgt ggtgtgtgcc tcaccccaaa aagccagaac acaccttagt cctgcttgac 480 
actgagggcc tgggagatgt aaagaagggt gacaaccaga atgactcctg gatcttcacc 540 
ctggccgtcc tcctgagcag cactctcgtg tacaatagca tgggaaccat caaccagcag 600 
gctatggacc aactgtacta tgtgacagag ctgacacatc gaatccgatc aaaatcctca 660 
cctgatgaga atgagaatga ggattcagct gactttgtga gcttcttccc agattttgtg 720 
tggacactga gagatttctc cctggacttg gaagcagatg gacaacccct cacaccagat 780 
gagtacctgg agtattccct gaagctaacg caaggtacca gtcaaaaaga taaaaatttt 840 
aatctgcccc aactctgtat ctggaagttc ttcccaaaga aaaaatgttt tgtcttcgat 9 00 
ctgcccattc accgcaggaa gcttgcccag cttgagaaac tacaagatga agagctggac 960 
cctgaatttg tgcaacaagt agcagacttc tgttcctaca tctttagcaa ttccaaaact 1020 
aaaactcttt caggaggcat caaggtcaat gggcctcgtc tagagagcct agtgctgacc 1080 
tatatcaatg ctatcagcag aggggatctg ccctgcatgg agaacgcagt cctggccttg 1140 
gcccagatag agaactcagc cgcagtgcaa aaggctattg cccactatga ccagcagatg 1200 
ggccagaagg tgcagctgcc cgcagaaacc ctccaggagc tgctggacct gcacagggtt 1260 
agtgagaggg aggccactga agtctatatg aagaactctt tcaaggatgt ggaccatctg 1320 
tttcaaaaga aattagcggc ccagctagac aaaaagcggg atgacttttg taaacagaat 1380 



gctggtttct gtgcgggctc 
gggaggctac agagcaggag 
gtgacatcat taagaattac 
ccatacgcaa cctgcaccac 
agaacaagag caaagcccca 
accaagagct tcgagccaag 
aagactatgc cttcgcacac 
tcgacaacac agagtttcaa 
tctagctcaa aacaggcaag 
ctgttttagc tggaaafcaat 
gaagcaaaca cttcctcttt 
acaaactgtg ggttacttta 
tgtaatctgc ttggggaaac 
tacctgcttc tttggtgatg 
ttatattctt tactgctttg 
gaagccctat attataaatc 
atcaaagttg tgcagtctct 
ccacctgttc atttcaccct 
tgatggtccc gcttctgaag 
ataccgataa taaaggtaca 
tagaactgtg ctaatatggt 
taaaatttaa aattcagttt 
atgaggcctg tggctaccgt 
tcttttggat agcactgttc 
tcatagccac ttttgtggtt 
ggttgttaag aagtactttt 
atgggagaag aggaaaagtt 
tagaggagca c 



ttgtccagga tggtgaagct 60 
attcgctcac tcttcgagca 120 
ggctttgtgc acatagaaga 180 
cacaagcctc atggggtgaa 240 
accaagttac acgtgggcaa 300 
tttgaggagc acggtccggc 3 60 
atggagcggg cagaggacgc 420 
ggtgaactgc tctgggcctg 480 
aacacaagac tatagaactt 540 
agattatgtt taccgctctt 600 
tcaggtttgc taagatgttg 660 
ccctgcggga ttcttgcatt 720 
gagtacctca tgagagaagg 780 
atcaggaagc cttatatttg 840 
ggtggatact ggtgggaaag 900 
tgggtggcaa gtctggatct 960 
tcaagcagag tcaaaaaaac 1020 
cagaaaagga aatttctaaa 1080 
aaatttcagt acaagcatcg 1140 
tatgttgtca ttaataccac 1200 
aaccacatgc ggcttagtaa 1260 
ttcaagtgta taccagacac 1320 
attaatagag acacagaaca 1380 
tacaagtgtt ttgttaacag 1440 
cctaagtcaa cacctttttt 1500 
gggtcttcta ttaaaactaa 1560 
gaccagagaa gggtaagaaa 1620 

1651 
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caagaagcat catcagatcg ttgctcagct ttacttcagg tcattttcag tcctctagaa 1440 
gaagaagtga aggcgggaat ttattcgaaa ccagggggct attgtctctt tattcagaag 1500 
ctacaagacc tggagaaaaa gtactatgag gaaccaagga aggggataca ggctgaagag 1560 
attctgcaga catacttgaa atccaaggag tctgtgaccg atgcaattct acagacagac 1620 
cagattctca cagaaaagga aaaggagatt gaagtggaat gtgtaaaagc tgaatctgca 1680 
caggcttcag caaaaatggt ggaggaaatg caaataaagt atcagcagat gatggaagag 1740 
aaagagaaga gttatcaaga acatgtgaaa caattgactg agaagatgga gagggagagg 1800 
gcccagttgc tggaagagca agagaagacc ctcactagta aacttcaggt atccaaatgc 1860 
aaaananaaa 1870 

<210> 19 

<211> 628 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG:402231 . 6 .-2000MAY19 
<220> 

<221> unsure 

<222> 580, 592 

<223> a, t, c, g, or other 



gcgctctctt ttccaggatc atccagcagc tcgtcaacgg catcatcacg cccgccacca 60 
tccccagcct gggcccctgg ggagtcctgc actcaaaccc tatggactac gcctgggggg 120 
ccaacggcct ggatgccatc atcacacagc tcctcaatca gtttgaaaac acaggccccc 180 
caccggcaga taaagagaaa atccaggccc tccccaccgt ccccgtcact gaggagcacg 240 
taggctccgg gctcgagtgc cctgtgtgca aggacgacta cgcgctgggt gagcgtgtgc 300 
ggcagttgcc ctgcaaccac ctgttccaca caacatacga gcaggcctgg ctggagcagc 360 
acgacagctg ccccgtctgc cgaaaaagcc tcacgggaca gaacacggcc acgaaccccc 420 
ctggcctcac tggggtgagc ttctcctcct cgtcgtcatc gtcctcctcc agctcgccca 480 
gcaacgagaa cgccacaagc aactcgtgag cccacgtcgg ccgtcgggaa agcacggggc 540 
ctttcccacc caccctcagc cagcgccaca cggcacccan agactgggtg cnccggcggc 600. 
gccacgcttg gctggtcagc gctgcagg 628 

<210> 20 

<211> 798 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1076157 . 1 :2000MAY19 
<220> 

<221> unsure 
<222> 777 

<223> a, t, c, g, or other 



aaaaaaaaat tgctttatgg aagaaagtaa gtatagacag agagaaaggg atctgatgac 60 
caaagcaggg aataaatgtt tggagtccac ggcatcctga gaacttcttg ggaatagagt 120 
ctaggccccc aatgctgtca ctctcaccca tcctcctcta cacatgtgag atgtttcagg 180 
acccagtggc ttttaaggat gtggctgtga acttcaccca ggaggagtgg gctttgctgg 240 
atatttcgca gaggaaactc tacagggaag tgatgctgga aactttcagg aacctgacct 300 
ctatagggaa aaagtggaaa gaccagaaca ttgaatatga gtaccaaaac cccaggagaa 360 
acttcaggag tctcatagaa gggaatgtca atgaaattaa agaagacagt cattgtggag 420 
aaacttttac ccaggttcca gatgacaggc tgaacttcca ggagaagaaa gcttctcctg 480 
aagcaaaatc atgtgataac tttgtatgtg gagaagttgg cataggtaac tcatctttta 540 
atatgaacat cagaggtgac attgggcaca aggcatacga gtatcaggac tatgcaccaa 600 
agccatataa gtgtcaacaa cctaagaaag ccttcagata tcacccctcc tttagaacac 660 
aagaaaggaa tcacaccgga gagaaaccct atgcttgtaa agaatgtgga aaaaccttta 720 
tttcccattc aggcattcga agacgcatgg taatgcacag tggggatgga cccttanatg 780 



<400> 



<400> 



20 



taagttttgt gggaaagc 



798 
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<210> 21 

<211> 410 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1083142 . 1 : 2000MAY19 



<220> 

<221> unsure 
<222> 51 

<223> a, t, c, g, or other 



<400> 21 

ttccgttttc 

tattggaacc 

ttcaggaatt 

acctggaccc 

tggtatcact 

gaaaagagcc 

gtgagataat 



gcgtggttct 
ccaccaagta 
agtgacattc 
taatcagagg 
gggaggacat 
ctggatgatt 
cagctacata 



tttgcaagct 
ggactgatca 
agggatgtgg 
gacttataca 
tccatttcta 
ttgagggaag 
gaagtaccca 



ctggattctc 
ggtcttacaa 
ccatagactt 
gggatgtgat 
aaccagttgt 
aaacacagtt 
cttatgaaac 



tggagtttga 
ttctaaaacc 
ctctcggcag 
gttggagaac 
ggttgattta 
cacagatttg 
agatatatcc 



ntgtttccag 60 
atgacctgtt 120 
gagtgggaat 180 
tatagaaacc 240 
ctggagcgag 300 
gatttacagt 360 
410 



<210> 22 

<211> 819 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 1083264 . 1 : 2000MAY19 



<400> 22 

cggaagccga 

cggttctcct 

cgagccagtt 

cccctccctg 

ttttgaaaga 

aaattttaag 

gaggattcac 

gaagtcagga 

ctctcaggaa 

gttggagaat 

ctccatgtta 

cgatccagac 

ggaagtaatg 

aaattatgag 



ttgcagggag 
gaccccgagt 
tccagcccac 
tgaatgtgtg 
gaaatgaaga 
aaagggagaa 
ttccaaagag 
atggccctta 
gagtggaaat 
tataggaacc 
gagcaaaaga 
ggcagggagt 
caggaaacaa 
tgatatacag 



aaactgtttt 
gtggggggtg 
agaaaatgag 
gagaaaaaga 
atgagagacc 
taaagtgaaa 
acatattatg 
ctcagggacc 
gcctggaccc 
tggtctccct 
gagatccctg 
gcatacaaaa 
gaccttgtaa 
ctaatttcaa 



cgcagcagtg 
acttcagtct 
ctcttccgga 
gatgggaacg 
cattaacaga 
aaaatctcag 
caaggaagca 
cttgaaattc 
tgcgcagagg 
gggaatctgt 
gactctgcag 
ggtgtgaaca 
aaatcaaatt 
gactgaaag 



cgcctccctt 
cctgacatcc 
agtgggcatc 
aggcagagga 
aggcaaagta 
aaggaatcca 
acttggaaga 
atggatgtgg 
actttataca 
cttcctgacc 
agtgaagaga 
cagagaggag 
ggattcaact 



ttccagccac 60 
agtgttctct 120 
ttattccaat 180 
aatagagaaa 240 
gaaggttcac 300 
ctcaacagac 360 
ggaaagaaaa 420 
ccatagagtt 480 
gggacgtgat 540 
tgagtgttac 600 
aaatagcaaa 660 
ctctaaattg 720 
tttacagtat 780 
819 



<210> 23 

<211> 2516 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LG : 350793 . 2 : 2000MAY19 
<220> 

<221> unsure 

<222> 85, 118, 146 

<223> a, t, c, g, or other 

<400> 23 

agtgtttctc atatctgggg tcactttaga caactgtgtt gaagttggac ggattgccaa 60 
cacctacaat ctaaccgaag tgganaaata cgttaacagt ttcgtcttga agaatttncc 120 
tgcattgctg agcacagggg agttcntgga aactcccttt tgagcgtctt gccttcgtgc 180 
tttccagtaa tagccttaag cactgtactg aacttgagct ctttaaggct acctgtcgtt 240 
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ggcttcgcct ggaagagcct cggatggact ttgctgcaaa attaatgaag aacatacgat 300 

ttccactgat gacaccacag gagctcatta attacgtgca aacggtggat ttcatgagaa 360 

ctgacaatac ttgtgtgaat ttgcttttgg aagccagcaa ttaccaaatg atgccatata 420 

tgcagccagt tatgcagtca gacaggactg ccattaggtc tgacaccact cacttggtta 480 

cactaggagg agtgctgagg cagcagctgg ttgtcagtaa ggaattgcgc atgtatgatg 540 

aaaaggccca tgagtggaaa tcgttagccc ccatggatgc cccaaggtac cagcatggca 600 

tcgccgtcat tggaaatttt ctctatgtgg ttggcggaca gagtaattat gatacaaaag 660 

gaaaaacggc agttgataca gtcttcagat ttgatcctcg atacaataaa tggatgcaag 720 

ttgcatcttt aaatgaaaag cgcaccttct tccacctaag tgccctcaaa ggatatctgt 780 

atgcagttgg tgggcgaaat gcagcaggtg aactgcccac agtagaatgt tacaatccaa 840 

gaacaaatga atggacctat gttgccaaaa tgagtgagcc ccactatggc catgctggaa 900 

ctgtgtatgg aggagtgatg tatatttcag gaggaattac tcatgatact ttccaaaagg 960 

agctcatgtg ctttgaccct gatactgaca aatggatcca gaaggcgcca atgaccactg 1020 

tcagaggtct gcattgcatg tgtacagtgg gagaaaggct ctatgtcatt ggtggcaatc 1080 

acttcagagg aacaagtgat tatgatgatg tcctaagctg tgaatactat tcacctatcc 1140 

ttgaccagtg gaccccaatt gctgccatgt taagagggca gagtgatgtt ggggtcgctg 1200 

tcttcgaaaa taaaatctat gtggttgggg ggtattcttg gaataatcgt tgtatggtag 12 60 

agatagtgca gaaatatgat ccagataaag atgaatggca taaggttttt gatctgccag 1320 

aatcccttgg tggcattcgt gcttgcacac tcacagtttt tccaccagaa gaaaccacac 1380 

catcaccttc tagagagtcc cctctttctg caccttaaga tcatctctac aactaagatg 1440 

ctgtagttct atctttgcaa tgtgtcataa attctcttct ttttccccct taagtagtat 1500 

atatgttagg attaccctct ggtaattgat acagatattg gaaaaaagac aacattgatg 1560 

ttatttgtgc tctttgtttg gcctagaatg tttataaagt ggtaacacaa ccattctgga 1620 

aatgtatccc atagaagctg atgtttaaca tatgaaaaaa aaagtattgt ctataaaatg 1-680 

tttcttcagt actttttaaa tgctgtgtat tgggtgtaag gtatttgtca tcttacatta 1740 

gtaaacccaa taagccaagt tgaaggtgga ttatagtaaa tgtacaactg tgctcactag 1800 

gcttcaagta aaaagttttc ctttcatctt tgactgtaag atgtcaaagg gaggcagcct 1860 

gcttgaacag gaaacaatac acaaaaggtt gccaactcgc atgagctacc tccctctttt 192 0 

cataaagtat ttttgacata tctgtcaacc cacttgactg tgtgggtgca ttgagaacac 1980 

aaagtttcct agacacacag gagaagtagc ttaaattcac taatattaat ttaaaaagca 2040 

gcatgaaccc tctacttata aacaagggtt tggtgttttt aaagtgtgta tacatacata 2100 

cacatacaca catgcacata tgtcaaatat aattttttta aaaattgagt ggcacatcaa 2160 

agaaatgtga aattaaaaag aattcttcca aaaagcagct tccattaaaa tgggaattca 2220 

gtatgcacat actgaatgca tatatgtaga accatacaga atttaggtgg ataagggcta 228 0 

gaaattttga gcaacaaaat ttgtcacttg accagatttt atcttcaaaa actgtattct 2340 

actccttctc ctttgctgtt gaggtaactt gcatattata tgtattctgt atactcagtt 2400 

cataaggtta tttagcacaa agtatagcag cttcacctgg agagctgctt ttgctcagta 2460 

aattcaactt ccatgtttta tctttttttg ttccaataaa aacatttaat gtcaaa 2516 

<210> 24 

<211> 1660 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LG: 408751 . 3 : 2000MAY19 
<400> 24 

tagggaccca ggatggcaga tccgggaccg ggctgggctg gcttggaaca tgcttgccaa 60 
ctcagccagc gtgaggatcc tcatcaaggg aggcaaggtg gtgaacgatg actgcaccca 120 
cgaggctgac gtctacatcg agaatggcat catccagcag gtgggccgcg agctcatgat 180 
ccctggcggg gccaaggtga ttgatgccac aggaaaactg gtgatccctg gtggcatcga 240 
caccagcacc cacttccacc agaccttcat gaatgccacg tgcgtggacg acttctacca 3 00 
tgggaccaag gcagcactcg tcggaggcac caccatgatc atcggccacg tcctgcccga 3 60 
caaggagacc tcccttgtgg acgcttatga gaagtgccga ggtctggccg accccaaggt 420 
ctgctgtgat tacgccctcc acgtggggat cacctggtgg gcacccaagg tgaaagcaga 480 
aatggagaca ctggtgaggg agaagggtgt caactcgttc cagatgttca tgacctacaa 540 
ggacctgtac atgcttcgag acagtgagct gtaccaagtg ttgcacgctt gcaaggacat 600 
tggggcaatc gcccgcgtcc atgctgaaaa tggggagctt gtggccgagg gtgctaagga 660 
ggcactggat ttggggatca caggcccaga aggaatcgag atcagccgtc cagaggagct 720 
ggaagctgaa gccactcatc gtgttatcac cagggatggg ggaaaccatg acgccgcctc 780 
ctggtgcagt gcacaccatc tctatccctg tcagccctca ctgggtcatg ggccttgggc 840 
agatgtcaaa gagcccagca gcagcggtgg tggccagctg ggcagagcat ccttgcttgg 900 
gctaggaaag ctttaccttc tctgagtgcc tccgcctgag agatgtgtga cccgtggcac 9 60 
cagggaacca cgtcttggag tggfcccactg taggccatgc gcttcatcca cccccagtcc 1020 
ctacataggc cctacccttg cccgggagct tctagataga aatcagaaag agattcaagg 1080 
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agccaaatga gcggtcagcc 
cttcgtcacc agccccactg 
gcagctctca aactggtctt 
tctctaggtt tgtggggtta 
cgtgtgtgta gtgctaggag 
cttttctgat actgacccag 
gctgaggcgc aggcacagcc 
acttccccac tcccctgtaa 
atggtcctgt gctttggcca 
ctggtgcagg tttgtattaa 

<210> 25 
<211> 2762 
<212> DNA 

<213> Homo sapiens . 
<220> 

<221> misc_ feature 
<223> Incyte ID No: LI : 33 6120 . 1 : 2000MAY01 

<400> 25 

gaagaccatg agggtgcaca gctggaaaac tctggtgtct cagcttaggg cctcctccgg 60 
gaagagctaa ctgctcccag gtgaagccgg tgcccgcggg cggtccgtac accccgcagc 120 
cggctcgcac cgctcgagag cctcggccgc tgtgtcttcc acgtctgcag ctcagccagg 180 
gcgcgcaggg cgagtggggt ccactggcgg gtaaagggga ccaggacggc gaggatggac 240 
gcacagacct ggcccgtggg ctttcgctgc ctcctccttc tggccctggt tgggtccgcc 3 00 
cgcagcgagg gcgtgcagac ctgcgaagaa gttcggaaac ttttccagtg gcggctgctg 3 60 
ggagctgtca gggggctgcc ggattcgccg cgggcaggac ctgatcttca ggtttgcata 42 0 
tccaaaaagc ctacatgttg caccaggaag atggaggaga gatatcagat tgcggctcgc 480 
caggatatgc agcagtttct tcaaacggtc cagctctaca ttaaagtttc taatatctcg 540 
aaatgcggct gcttttcaag aaacccttga aactctcatc aaacaagcag aaaattacac 600 
cagtatactt ttttgcagta cctacaggaa catggccttg gaggctgctg cttcggttca 660 
ggagttcttc actgatgtgg ggctgtattt atttggtgcg gatgttaatc ctgaagaatt 720 
tgtaaacaga ttttttgaca gtctttttcc tctggtctac aaccacctca ttaaccctgg 780 
gtgtgactga cagttccctg ggaatactca gaatgcatcc ggatggctcg ccgggatgtg 840 
agtccatttt tgtaaattat tccccaaagg agtaatgggg acagatgggg gaggtccctg 900 
ctgcccagcc gcacttttct gcaggcactc aatctgggca ttgaagtcat caacaccaca 960 
gactatctgc acttctccaa agagtgcagc agagccctcc tgaagatgca atactgcccg 1020 
cactgccaag gcctggcgct cactaagcct tgtatgggat actgcctcaa tgtcatgcga 1080 
ggctgcctgg cgcacatggc ggagcttaat ccacactggc atgcatatat ccggtcgttg 1140 
gaagaactct cggatgcaat gcatggaaca tacgacattg gacacgtgct gctgaacttt 1200 
cacttgcttg ttaatgatgc tgtgttacag gctcacctca atggacaaaa attattggaa 1260 
caggtaaata ggatttgtgg ccgccctgta agaacaccca cacaaagccc ccgttgttct 1320 
tttgatcaga gcaaagagaa gcatggaatg aagaccacca caaggaacag tgaagagacg 1380 
cttgccaaca gaagaaaaga atttatcaac agcctttcga ctgtacaggt cattctatgg 1440 
aggtctagct gatcagcttt gtgctaatga attagctgct gcagatggac ttccctgctg 1500 
gaatggagaa gatatagtaa aaagttatac tcaagcgtgt ggttggaaat gggatcaaag 1560 
cccagtctgg aaatcctgaa gtcaaagtca aaggaattga tcctgtgata aatcagatta 1620 
ttgataaact gaagcatgtt gttcagttgt tacagggtag atcacccaaa gctgacaagt 1680 
gggaacttct tcagctgggc agtggtggag gcatggttga acaagtcagt ggggactgtg 1740 
atgatgaaga tggttgcggg ggatcaggaa gtggagaagt caagaggaca ctgaagatca 1800 
cagactggat gccagatgat atgaacttca gtgatgtaaa gcaaatccat caaacagaca 1860 
ctggcagtac tttagacaca acaggagcag gatgtgcagt ggcgactgaa tctatgacat 1920 
tcactctgat aagtgtggtg atgttacttc ccgggatttg gtaactgaac tcttctgtcc 1980 
tgacatacct tactgaagtc tcgatttctt ctctctctgc atatgcctgg aataagagat 2040 
cctttttcaa tgtaacaatt atatttatga aaagatatgt tacactaact tctcagaagc 2100 
caagctgaaa tattcataaa gtccctaaaa ctcaacgttt aaatgacaca ctttaaaaat 2160 
atgtcttttt tcaatctaac tgaaaacctt cttaacttct aatatattaa atctgaagat 2220 
gtgaagggca cagaagtgac tttgaataag aagaatttag tgtatctgta attttattat 2280 
caattcccaa gccccttcct ttctaaatta aaaatgtttt catttgaaag tgtatttgcc 2340 
agacaatgaa aacagtatgc agtatttctt aaagtattga aattagaata tcatgaaata 2400 
aatcaaaaca tacaatggca agtagtatgc atgcatattc aagagactct tccatttttg 2460 
caagctgtag aaggaaatgt ctgaatgtct ataagttatg gggtagattc ttgagaagca 2520 
tttccatata atttcactga agaaccttga taattttgac ccactgtaac ttagccactg 2580 
atgaacctta aagctgagta ttttattaac acctgatttg tattccatta tattcaaaat 2640 
gcatctttgg tattgtgcct ctgctcccat ctctctcttt gcctcataga tttagctatg 2700 
ttgggaagca catgcttgct ctaggaatat ctccaataaa gctgttaact atttggtgga 27 60 



cccaccatgc actccttgcc ccgtgcagag ctccagccag 1140 
gctcctggtt ggaacgaaag ggtctctggt tgcactgaat 1200 
gtacttgctg aataaatact gttgttcttg ccttagctgc 1260 
agttgccaga aaattgtgct actgtgtgtg cgtgtgcgtg 1320 
tccacagtag gtctctgtca agccgatgtc gtgatgaggg 1380 
aagccacaga accacaagga aacccaaacc ccctccagct 1440 
tggggtcgga tggagcctcc agcaccccag cacccaggtg 1500 
atgtcatggt gctaagactg tgtcaacccc aagacgacac 1560 
ccgtttgagg caaaaactaa acagcccgac acgttgtgtt 1620 
actgtagcta cttctcaaaa 1660 
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aa 2762 

<210> 26 
<211> 4328 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 234104 . 2 : 2000MAY01 
<400> 26 

tgcgcccgga gccggggccg agtcgctgcc gcagctgttg gggcgcccgg gccaggcgac 60 
gcggccgtcg cccgtgcccc tcccagaccg caccggccgc atggagcccc cggagggcgc 120 
cggcaccgga gagatcgtta aggaggctga ggtgccgcag gctgcgctgg gcgtcccagc 180 
ccaggggaca ggggacaatg gccacacgcc tgtggaggag gaggtcgggg gcatcccagt 240 
accagcaccg gggctcctgc aggtcacgga gaggaggcag cctctgagca gcgtctcctc 300 
tctggaggtc cacttcgacc tcctggacct cactgagctc accgacatgt cggaccagga 3 60 
gctggccgag gtctttgctg actcggacga cgagaacctc aacaccgagt ccccagcagg 420 
tctgcacccg ttgccccggg ccggctacct gcgctcccct tcctggacga ggaacaaggg 480 
ctgagcagag ccacgagaag cagcccctag gcgaccccga gcggcaggcc acagtcctgg 540 
acacgtttct cactgtggag aggccccagg aggactagac catctccacc tgccccagct 600 
cctgcaggga tggggtccga acacgatggc agatctgggc cagtgctgac cccagcagac 660 
acacttcacc cgcccacgag gctccagccg tcacctcctg acacacaccc tgggggcagc 720 
tctctgccag ccccgagacc ggccttgtct tgctgggcac gggtcttcgc ctcacttgtg 780 
agaccagccg gctttcctgg ggggacacac ggggcccccg gtatgcctct ggggagcccc 840 
agcacaagca cagcccagtg gccttacgtc cagctcgttc ctgggccccg agtcaggaag 900 
acagcgtcac ggagtcactg ccaggaacgt gctgaggaat ggagtggccc acggcggcct 960 
tggggtgaag gggacccagg gcctgtgaca gccactccag gaactcctgg gggtgctcca 1020 
acctccgcgt tttcctgtgc tgccaagctt cagaagccag atgcgggttt ggtagtggct 1080 
aatgggacaa tgtgctgtcc agcaaagcac acatggagaa gcggccccaa aattcccatc 1140 
cttgatttcc atcctgcccc ttcttctact ccacggagtg cgctgtctca ctagtggtcc 1200 
ccctccacaa ggctcagcct ctaagacctg cacctgcttc tcttggcccc tgcgtgacag 1260 
acaagtccat tccctcctta gctcagaaca ccaaatatca ccagactgcc taagagactt 1320 
gatgacacct cccggaatgc tctcggggtt ggggttcacc tctccttgtc ctgcacccac 13 80 
tgctaggcca cattctcgtt tctgctcaca tcccattgcc cggctacaag gcctgcccac 1440 
ggcccttaaa cttgctgggc aggtttggag ccccatggga ccccgtgggt ctctgtccag 1500 
gagcagcaga ggaaggttga caggccctgc tccctctgct ctgggggtgt ctgggagccc 1560 
cagctcacac cctcccaatg cttatatgct gaagctcaca gaatgggctt cttgcctgac 1620 
agcgaagtca aagaatgagt ttaatatcaa agtgtaagct tactttccat ccccaagcca 1680 
gactggcccc tgccccattt cccatgagca cacttctggg gaaggacaac aggctccctg 1740 
gccttcactc tcagcagagc tttgggagat gcccccaggc atgcccgtga gctccttctg 1800 
tgtacctgct cccacttcgt gagccacccg gctgcccctc cgcactgctg gcaaacccag 1860 
ttccctgcct cagcccaggt ctccttccct ggtttccagt cacacaagag cccagcagct 1920 
ttctctttca gtcccataag gggcagcctt ttgtccctgg ccactcttat ctttccccac 1980 
ttcattccac ccagtctccc tcccccgtcc ctgcccaaac gcgcgcccct ccgcccctcc 2040 
cttggcccca gcgcccagcc ctgctctccg cgctcggcca gagggagcca gtccggagac 2100 
ggccgcacct ggctggagag gctgggcggg cggatgggtg gaaactcgcg gacgcgggag 2160 
ccgatctgga ccggagcagc cgcgagcaga atggagtctc ctaacagcct ctcggtgctg 2220 
atgtgaaatt tgaccatctg attccagttt ttttcttttc cttttctttt ttgcatttcc 2280 
ttccctcgcc atccgtcgtg tagtgaattg ttcagtcttg ctccgtttca agagaggaga 2340 
tcatgattga gtgaagccac cccgtccgca gccaggaaaa gcacaaagaa gaaactgcaa 2400 
caatggccaa gctgacagaa tccatgacta acgtcctgga gggcgactcc atggatcagg 2460 
acgtcgaaag cccagtggcc attcaccagc caaagttgcc taagcaggcc agggatgacc 2520 
tgccaagaca catcagccga gatcggacca aaaggaaaat ccagaggtac gtgaggaaag 2580 
acggaaagtg caatgttcat cacggcaacg tgagggagac ctatcgctac ctgaccgata 2640 
tcttcaccac attagtggac ctgaagtgga gattcaacct attgattttt gtcatggttt 2700 
acacagtgac ctggctcttt tttggaatga tctggtggtt gatcgcatac atacggggag 2760 
acatggacca catagaggac ccctcctgga ctccttgtgt taccaacctc aacgggttcg 2820 
tctctgcttt tttattctca atagagacag aaaccaccat tggttatggc taccgggtca 2880 
tcacagataa atgcccggag ggaattattc ttctcttaat ccaatactgt gttggggtcc 2940 
attgtcaatg cttcactggt gggatagcct gtgtgtgaaa atctctcaaa cccaagaaaa 3000 
gggcagagac cctggtcttt tccacccatg cagtgatctc catgcgggat gggaaactgt 3060 
ggctgatagt tccgggtagg ggaccttagg aattcccaca ttgtggaggc ttccatcaga 3120 
gccaagttga tcaaatccaa acagaactcg gagggggagt tcatcccgtt gaaccagacg 3180 
gatatcaacg tagggtatta cacgggggat gaccgtctgt ttctggtgtc accgctgatc 3240 
attaggcatg aaattaacca acagagtcct ttctgggaga tctccaaagc ccagctgccc 3300 
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aaagaggaac tggaaattgt ggtcatccta gaaggaatgg tggaagccac agggatgaca 33 60 
tgccaagctc gaagctccta catcaccagt gagatcctgt ggggttaccg gttcacacct 3420 
gtcctgaccc tggaggacgg gttctacgaa gttgactaca acagcttcca tgagacctat 3480 
gagaccagca ccccatccct tagtgccaaa gagctggccg agttagccag cagggcagag 3540 
ctgcccctga gttggtctgt atccagcaaa ctcaaccaac atgcagaact ggagactgaa 3600 
gaggaagaaa agaacctcga agagcaaaca gaaagaaatg gtgatgtggc aaacctggag 3660 
aatgaatcca aagtttagtg ccctagctgg gcaaaccctt ctcttctccc cccaacacaa 3720 
tctttccttg tctctcattc tctttctttt tctgtctctc tggctttgtt ctttatttgt 3780 
ttatatttaa tttttacatg accagaaaac aaatcttcaa ggtgtaaaat atctacctgc 3840 
cctctctcag ttattcagat tgacaaggta gacatggatt tgatgaaagt gcaaagtgcc 3900 
ctcatttgtg gcccaagcct ggtctcctcc caaaatacta cacatccaac tcctggagat 3960 
ttcagttact tacctgcatg tgttgtacaa taccagatca ctcaaaaagg tgtgtcaaag 4020 
attttacctg ggatatgaca agcaaggttt ctggtgccta tttattcatt cagtgagaca 4080 
cagagtggag ccctcagttt tatggatccc aattcatttc atctactaca gggtgaggtg 4140 
cttgccccca tgtgggtgtg gcagttacag ggcccaggtg agctgaagac aaaccactgt 4200 
acatatatat gccttatgta attattttct ttttgtaatt agtaataaaa cccagcatgt 4260 
acaaaagtac catagaacag aactgctaaa tactgtacat agatgtatca ttaatgtagg 4320 
tttagata 4328 

<210> 27 

<211> 569 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: LI : 450887 . 1 : 2000MAY01 
<400> 27 

cgtcggttca cttctccagg aaagggttcg tactcatggc gccgccgcag ccaaagtcgg 60 
gcctcttcgt tggcatcaac aagggtcatg tcgtcaccaa gcgcgagctg cctccccgcc 120 
cgtgccaccg caaggggaaa tcaacgaaga gggtgtctat ggtcaggggc ctgatcagag 180 
aggttgctgg gtttgctcct tatgagaagc gtatcactga gcttctgaag gttggcaagg 240 
acaagcgtgc cctgaagctt gctaagagaa agcttggaac tcacaagagg gcaaagaaga 3 00 
agagagagga gatggcgggc gtcctcagga agatgaggtc ggctggtacg cacactgaca 3 60 
aaaagaaata gagagcattt caagttcatg gagctggctg ccagagatta tgttccagtg 420 
tctgattttc catacatgta gaacctaata gacatgtcaa agtattatgt atcgaaccag 480 
ctcatgggat tttgctcctt ccaatgcatc cagggtttat gtatcgaacc aatttatggg 540 
atcttgctct tattctaatg catccatgg 569 

<210> 28 

<211> 3644 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 119992 . 3 : 2000MAY01 
<220> 

<221> unsure 
<222> 2628 

<223> a, t, c, g, or other 
<400> 28 

gacaatcttc aggacacact tgaagctgct agctttttta caaatattac ccgttttgga 60 
tttctgtaaa gtatttctta tatcaggagt ctctttggat aactgtgttg aggttggacg 120 
aattgctaac acctacaatc ttatagaagt ggataaatat gttaataatt tcatctctga 180 
agaactttcc tgctttattg agtactgggg agtttctaaa actccctttt gaacgacttg 240 
catttgtgct ttccagtaat agtcttaagc actgtaccga acttgaactc tttaaggcag 300 
cctgtcgctg gctaaggttg gaagaccctc ggatggatta tgctgcaaag ttaatgaaga 360 
atattcgatt tccactgatg acaccacagg atctcatcaa ttacgtgcag acagtagatt 420 
tcatgagaac agacaatacc tgcgtgaatt tgcttttgga agctagcaat taccaaatga 480 
tgccatatat gtcagccagt gatgcagtca gatagaactg gcaatcgaac tggattccac 540 
tcacttggtt acattaggag gagttttgag gcagcagctg gttgtcagta aagaattacg 600 
gatgtatgat gaaagggcac aagaatggag atctttagcc ccaatggatg ctccccgtta 660 
ccagcatggt tattggctgt tcattggaaa ctttctttat gtagttggtg gtcagagtaa 720 
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ttatgataca aaaggaaaaa ctgctgttga tacagttttc agatttgatc ctcggtataa 780 
taaatggatg caggttgcat cattaaatga aaagcgcaca ttctttcact tgagtgccct 840 
caaaggacat ttgtatgcag ttggtgggcg cagtgcagct ggtgaactgg gcacagtaga 900 
atgttacaac ccaagaatga atgagtggag ctatgttgca aaaatgagtg aaccccacta 960 
tggtcatgct ggaacagtat atggaggctt aatgtatatt tcaggaggaa ttacccatga 1020 
cactttccaa aatgagctca tgtgttttga cccagataca gataaatgga tgcaaaaggc 1080 
tccaatgact acagtcagag gtctgcattg catgtgtaca cgttggagat aagctctatg 1140 
tcattggtgg caatcacttc aagaggaaca agtgattatg atgatgttct aagctgtgaa 1200 
tactattcac caacccttga ccagtggacc ccaattgccg ccatgttaag aggccaaaga 1260 
tgatgttgga gttgccgtct tttggaaaat aaatttaatg ttgttgttga atattctggg 132 0 
aataatcgtt gtatggtaga aattgtccag aaatatgacc cagaaaaaga tgagtggcat 1380 
aaagtttttg atcttccaga gtcacttggt gggcattcga gcctgtacac tcacagtttt 1440 
tccacctgaa gaaaaccctg ggtcaccttc tagagaatca cctctttcag caccttcaga 1500 
tcattcttag gtctaaggtg taacaccttt gcagtacgtc gatgggtgat ctaatacttc 1560 
cccttcagtt gtatcttctt acagtgattg gtacagttat tagatataaa ggtaactgat 1620 
gttattcgtc ttgtatggct tttagtatgt gctatcaagt ggctaacaaa tgcattctga 1680 
aaatgtattt aacatagctg tgctaacaaa tgaaaaaaag acgtagaaaa atgtttagat 1740 
gtctttttgt gatgttatat aaaattgtag atgactgtgg taaatgtgta attatgtcca 1800 
ttatgcttca aagttgaagt tttcatcttt gactccaaaa tgtcagaggg aggccgctct 1860 
aaactaaaaa taacgaaagg ttgccaagta ttaatactag ttacctccct cttttcgtag 1920 
tttttgtcat gtctgtcaac ttactcgatt gtgtggttgc attcagaata tttgaagttt 1980 
cttacgtaga cagaaataat aaaaatatta actaggaaaa aacagtatag caccaagcca 2040 
gtatttggta tctctctcta gagcgagcaa gagagggaga gaggaggaaa aaatacacat 2100 
aatacaaaca tacatgcatg cacacataca tacatatgta tacacacaca taatttgaaa 2160 
actgattggc cacttcaacg atggctgaaa ttgtttttaa attgaagttt ctttcttcca 2220 
caaagcagcc cgtttctatt caaatggaaa ttcagtacca gagaataaat gtctatgtag 2280 
tcatactgaa tttagataga taagggctac aagcatacta aatcgagcaa ccaaatttgt 2340 
catgtgacta aacccgttac ttcagatgaa gcttacatta ctgttttctg cttgtgtatt 2400 
ttcccgtaga gtacttttac acagattggt aaaggttcag gtttcacgag aactgctttt 2460 
gtgcagaaaa tttaggttct tttttccacc ttttttgggt cagtaaaact taatgaaaaa 2520 
agcaaagaaa aaaaatattc tggaacaaag ctataagggt tttaaagttc agcctcccaa 2580 
cgttaagtca tcctaacatg attattttgt gatttggggg tgcttgcncc tggtgctgtt 2640 
ccagtccatg tggcatcctg agctgtgtga tctgcctcga ggctatgatc tgagcacgca 2700 
ggagataaca ttttcttctg catcaagtga ggaaaaatgt gcttttgggc catgtctcaa 2760 
agacaggacc aacttcagat ttcccaaaga agccagctac agagcctctg gaacactatg 2820 
gtcttacaag cagtacttaa aatcaaccct cgagcctctt caatgccgaa aggtatcccc 2880 
tatttggttg agaaccacat ggtaattttt aatgggactt tttatcagca aatggagtta 2940 
caggaattct ctgtaatgag tgattctgaa gaggtacttt cctgggaata attatctacc 3 000 
tgaagaaaaa aaattttata tatacattgt gtgtgtgtgt aatacacaca cacacaagcc 3 060 
ccctaatacc tggaagattg tcagcatgta aatcaggaac aactttctcc cttattgaca 3120 
atccccatta attaaaactc aggaaccaag gcaaaatgaa ttggcttcta gggggtctga 3180 
accttactgc cccatacaag tgttgattca ttttaatgct gtttatgatt tctgcattgg 3240 
cagaaaattt tcatactttc tatgtttttt ttaattactc agttttttat tacctaaaaa 33 00 
taggcacatt tgagtacatt tgaaaagtag aaaaattaga aattattaac tttattgaat 33 60 
aagcaagaag tgcatcctaa tccctttgat tattaatgag gttgaatatt tgtgtgctat 3420 
cggtagctgt gtttctttga tcaaatgttc ctgtcctttt gcccttctgt tatctgttgg 3480 
gagttgcttt gtttttcgta tcaagttata gggatctctt tatataataa atgtaattta 3 540 
acttgcattt gcttgcattt atttcttccc tcaatctgtt gtagttttac aaaggcaacg 3 600 
ctgttcagtt aatttttgag atcaaatttg tctttttttt tttt 3644 

<210> 29 

<211> 2805 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 197241 . 2 : 2000MAY01 
<220> 

<221> unsure 
<222> 325 

<223> a, t, c, g, or other 
<400> 29 

ccccgttccc gattcctgta gtagcggctg tattgcagcc gcctgccgaa ctgacccggg 60 
tctggggact ggcccctctg gcgccgttcg gtttctctta ttgccttcac tgaggatgag 120 
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WO 01/62922 



PCT/US01/05896 



tccctttgtg 
ggtcaccctt 
ccccctcctc 
ttttaaactg 
tttggattta 
aggcaggatt 
cctttgtgtc 
tataaaaaca 
tgaagcatgc 
cccttctctc 
gccctcca.cc 
aaaggagtgg 
cactttttta 
atattcaaga 
atgtagtatt 
ttaccctggg 
tccaggagtg 
tctcagaaag 
ggagatgaac 
tcccaagtgg 
acatttggtc 
gtacccagac 
ggcaggtttc 
ggacaactcg 
gtctctgccc 
acagtctgtg 
atggcttcta 
aaggttgatc 
gatggagctg 
ttgggcctat 
tatttgttaa 
gagttctgga 
atgtctctga 
atagagcctg 
gatcctaaaa 
gtttggcaaa 
agttggcaat 
ttcagaaatt 
gatgaagatt 
ttgaatacct 
aaacccaaga 
atttgaataa 
aaagtaatca 
caaggcaaaa 
ttcgacttca 



gctctatgtg 
ggcaatttat 
actccccctt 
aagaacttgt 
agctggccaa 
taacattatc 
ctggcttttt 
gttagggatg 
tttggaagtc 
cacgtcattg 
cctgtttcct 
atctgttggt 
tggacatatt 
taaagaaggc 
caagaatact 
tcatggcaag 
ggatttatat 
ggcaggttta 
agacatgctt 
cagctgctaa 
taaacatttt 
agatacaggc 
atacagtcct 
gttgtttgct 
ttcaccataa 
ttaccacaag 
aacagttgaa 
ctgaacattt 
gaagggtgtt 
ccacgtcagg 
cgcaaggatg 
aaagaaagag 
tataaatagt 
ttagtgtcag 
caagccttta 
ctgttgaggg 
agactcttcc 
gtttcttttc 
ctgccagggt 
tttacaattt 
atacacttaa 
agtgaatttc 
agctcaaaca 
atattaggga 
gtaatttgag 



gaccctgcgg 
ggatatttaa 
ggtgcttgac 
atttncgata 
aaaattgcta 
ctaaaaaggt 
cattttttaa 
agttcaccca 
ccttctgtgg 
ttacaaagct 
cc tcgtggag 
gaaagaccaa 
gattgtgttt 
ttgtcagctt 
gatcctacag 
ccagaatagc 
caagcaggtg 
tacctgtggt 
ggtccctcgg 
ggatcatact 
tcatcaatta 
aaaatatctg 
atggactaga 
agatcccaat 
agacattgct 
gggagatatt 
cttgaaaaaa 
gaaagaaaat 
ttgctggaga 
gtcttcattt 
gagaaggatt 
attttatcaa 
gtgtatgaaa 
cacagatcca 
tgaaaattcc 
aagcagatga 
ctgcacataa 
agatggtaat 
gccatctctt 
atatacacag 
acaaaaaccc 
catgaagatg 
gttagtgaga 
agatgatcct 
tagtaggtta 



aatccaccgg 
acagggtcag 
tccaggaata 
tgaactttat 
taacagatat 
aatttattct 
aattagtttt 
tgcctgactg 
taacaaaggg 
gccactatca 
aagaaaggag 
gagfcctggat 
ggtctctatt 
tggatcttgt 
atgtttatac 
aaacatcatc 
gtgctttgta 
catggtcctg 
cttgtggaag 
giztgtattaa 
ggaattattc 
aaaggaagga 
gaagctgttt 
ggagaaaagt 
ctgtctttgg 
tacttacttg 
gttcttgtgt 

gggggtcaaa 

tcagtcaaca 
ctgatatggc 
tagagggaga 
accttcacga 
gaattcgact 
agtggatgca 
agctgtgtcc 
aatggacagc 
atatattttg 
acttcagaat 
tgtggtagag 
atacttgtga 
agaagaacta 
ataaccagaa 
ggcagaagag 
gtaagaatgt 
gatggagtca 



cgcagtttca 
acagtgtgga 
atttataaac 
agaagctatt 
atacgtttta 
ctgtaacttc 
cacgattctg 
cacatcaaag 
gagcgaaaac 
aggatgtttt 
tgttagattg 
ggacagcctt 
gaagcatggt 
aatgaaggat 
ttggggcgat 
cagagttggt 
aatttcactc 
ggagggcgat 
gactgaatgg 
ctgaagatgg 
caccgccttc 
caatcattgg 
acactatggg 
gtgtaactgc 
ttgctgcaag 
.cagactatca 
ctgggggtca 
aaatttgcat 
gttctctgaa 
tttaaataga 
tggtttgaag 
ttcctcatca 
tgagaaactt 
actttgcaat 
tcatcatcct 
attcatgatg 
gcagtgcatt 
ttacagatat 
aaggttcatc 
ctttttaach 
tcagggaact 
gtctgcattt 
caaacctaaa 
tgcaaactgt 
gattt 



tctagcgact 

cgggggagtt 

tgtggaattt 

tataactttt 

tactattgtc 

ctcaatagca 

aagtaagtgg 

tgtcgatccc 

ccgattaagg 

tggcaggaat 

gcttattcag 

gcaccagaag 

gttagtctgt 

agaccaactc 

aatacaaatt 

ggatctgttc 

cgtgtttctg 

tagggacatg 

tcataattgt 

atgtgtttafc 

cagttgtaat 

cgttgcagca 

actacatggt 

tcctcgtcag 

tgatggagct, 

gtgcaagaag 

tatggaatac 

tcttgcaatg 

gcagtgtcga 

aatgaaattc 

agaaaagaaa 

gatgtgtctt 

acctttgcac 

cctgcagtca 

tttttgaaga- 

tgacatttca 

ctgatttttt 

ttaccagaaa 

cctgacatgt 

ccatggcttc 

ctgaattctc 

gaagtttaca 

tcttgtgaaa 

gtggaagaaa 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

I860. 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2805 



<210> 30 

<211> 572 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:406860.20 :2000MAY01 



<400> 30 

gtttgtatgt 

tgttctcaag 

aggcctgctg 

ggattgtgca 

agctgcgctg 

aagtggaggc 

tgccccacgc 

tcaccttcaa 

caagcccgct 

tccatgccga 



gatgctggag 
agctgtgctg 
gccaagccta 
cccggcagcc 
tgaggtggcc 
atcagatgcc 
ccagcctgag 
tgtcatcctg 
ctgtgttggc 
gggccccgcc 



atgactcggc 
ccagtgggga 
tgtgcccctc 
cgctccctgg 
ccagctgggt 
ctgcagctgg 
gacgccgggg 
gctgagcctc 
cccggggagc 
gagtctctgc 



cttcttcact 
gggctgagga 
tccatgccca 
atctgcattt 
ctcaggtgcg 
gtgccgaggg 
agtatgtgtg 
cagtgcagtt 
cagtggtgca 
at 



gtcactgtca 
tctggcgggt 
cccaccagcc 
tggggctcca 
ctggtacaag 
gcccacccgc 
tgagacccgg 
ccttgctcta 
ggagggcgag 



cagctggcac 60 
gaggcaggag 120 
ccaccagaga 180 
gggcgcgtgg 240 
gacgggctgg 3 00 
accctgaccc 360 
catgaggcca 420 
gagacaactc 480 
ggcctagagc 540 
572 
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<210> 31 

<211> 1082 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 142384 . 1 :2000MAY01 



<400> 31 

ggcggacgtg ctgccgagta gtcccggaag 
ggaggcggca acggcgggcg ccggggcggc 
tgttggcgtt agcggcagcg gcggcgggtt 
ggcggcggcg gagtgtgggc ggggccgggg 
ggctcccgcc ctcagccgct gcccacgggg 
tcagctccga ctgggacggc gagcgcaccg 
atatcatgtc catttataag gagcctcctc 
acatgactaa gattcatgca ttgatcacag 
tcttcctgtt cgtgtttcgg tgtccgcccg 
tgatgacaac gggcaataac acagtgaggt 
tctgcttgag tattctaggt acatggactg 
cctcagtgct catctctatc cagtccctga 
gctttgaaca ggagagacat ccaggagaca 
agaccatcag agttgcagtc tgtgacatga 
tacgaggggt gatggagaag tcctttctgg 
aagatcgcct gcaccttcaa ggccaaacta 
actttgacta ccagtccctc ttgatgcgcc 
ggctccataa tgagaatgca gaaatggact 
ac 



cgaagcagcg atggcggaga gtccgactga 60 
gggccccggg gcgagcagcg ttgctggtgt 120 
cgggccgcct ttcctgccgg atgtgtgggc 180 
gcccggggag cggcctggct ccgctgcccg 240 
ccgcgctgct tagccactgg gaccccacgc 3 00 
cgccgcagtg tctactccgg atcaagcggg 3 60 
caggaatgtt cgttgtacct gatactgttg 420 
gcccatttga cactccttat gaagggggtt 480 
actatcccat ccacccacct cgggtcaaac 540 
ttaaccccaa cttctaccgc aatgggaaag 600 
gacctgcctg gagcccagcc cagagcatct 660 
tgactgagaa cccctatcac aatgagcccg 720 
gcaaaaacta taatgaatgt atccggcacg 780 
tggaaggaaa gtgtccctgt cctgaacccc 840 
agtattacga cttctattag ggtggctgca 900 
tgcaggaccc ttttggagag aagcggggcc 960 
tgggactgat acgtcagaaa gtgctggaga 1020 
ctgatagcag ttcatctggg acagagacag 1080 

1082 



<210> 32 

<211> 2497 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 895427 . 1 : 2000MAY01 



<220> 

<221> unsure 
<222> 1938 

<223> a, t, c, g, or other 



<400> 32 

tagcctgcac ctgtacggtc tcggggggct 
cctctgctgc ctactgcaag tggatgggga 
ggggccggac ttcctgctgg ctggaccaca 
tcctgcgcgc cctggtgctt gcgtgggacc 
agggcaccgt gctgctgccc acggtcttcc 
gcctggagcc tcaggggctg ctgtatgcca 
ctgccacagc tgagccccgc gtctttgggc 
cggccccccg gccaggtgcc cctacatcat 
agggctgcgg gtagtgggac tgtaccgtct 
tcgggatgcc tttgagcggg acagtgcagc 
tatcaatgtc atcactggca tcctcaagga 
cacccagccc ctgtataagg tggtactgga 
gttcccccca ccactgaggg cacccgaggg 
ggccacgctg acgcttctcc tggaccacct 
ccgcatgacc ccacagaact tggccgtgtg 
ggcgcccaca aggcctcgtg cccgcagctc 
caagcaccac atcgaggtgc tgcactacct 
ccgacaatct ccagatgtcg cgccttactt 
gccgctggca gaccccgaag tggtgactcg 
cccgagcaac cgctacgccg gcgactggag 
ggcgggattt cctgtccggg ccagactacg 
gaggacgagg aggtcggcga gccgagggtc 



gcggccagcg ccgggggcca cccccaggga 60 
ggccagggcc cgaacagggc cactgccacg 120 
ccttccacct ggagctggag gccgccaggc 180 
ctggcgtgag aaggcaccgg ccctgtgccc 240 
gagggtgcca ggcccaacag ctggccgtgc 300 
agctgaccct gtcggagcag caggaagccc 3 60 
ttgcccctgc cactgctggt ggagcgggag 420 
ccagaagtgc gttgggcaga tcgagcgccg 480 
ttgtggctca gcggcagtga agaaagagct 540 
ggtctgccta tctgaggacc tgtaccccga 600 
ttatcttcga gagttgccca ccccactcat 660 
ggccatggca ccgggcaccc cccaaacaga 720 
ctcctacagc tgcctgccag atgtggaaag 780 
gcgcctcgtc tcctccttcc atgcctacaa 840 
cttcgggcct gtgctgctgc cggcacgcca 900 
cggcccaggc cttgccagtg cagtggactt 960 
gctgcagtct tggccagatc cccgcctgcc 1020 
gcgacccaaa cgacagccac ctctgcacct 1080 
gccccgcggt cgaggaggcc ccgaaagccc 1140 
cgtttgcggg cggggacttc ctgacctgtg 1200 
accattgtga cgggcagtga cagcgaggac 12 60 
accggtgact tcgaagacga cttcgatgcg 1320 
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cccttcaacc cgcacctgaa 
gagagctctc caaagcaaaa 
ccccggttag taaggaccgg 
agccagacct gttgctcagg 
tcgcgtgtat ggcttgagac 
ttgccaagtc tggggcctgg 
agcaccaagg gcttcccctt 
gcctctcttg cctccccttg 
tgggtgagtc acctctagta 
gagcactgca ctgtttctcc 
tttgtggggg gcagcccnct 
gggcttatag acggaacaca 
caatggcctt ttagcagcca 
ccttgccccc cccttgcccc 
ttggaggggg ccccctgaag 
ttgcctccca tgcatgtgct 
gaaggcaagg catggtgagg 
atgccccacc caccctggaa 
tgggagagtg acggttccct 
ataataaata aatatgcaca 

<210> 33 
<211> 2876 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: LI : 757439 . 1 : 2000MAY01 

<220> 

<221> unsure 

<222> 1472, 1495, 2463 

<223> a, t, c, g, or other 

<400> 33 

cggaagccgc ggtagcggag aagactggag ctccgaggag ctgcatctgc ggcaacctgt 60 
gtgctgacgc tacgtgcctc ctggcttccg acgtagctcg cagctcccca gtctcactcc 120 
attccttccc cacctggcgc gcacctgctc aagaccaggg tcctgccaag cgctaggagg 180 
gcgcgtgcca ggggcgctag ggaactgcgg agcgcgcgcg ccatggggcc gccgcctggg 240 
gccggggtct cctgccgcgg tggctgcggc ttttccagat tgctggcatg gtgcttcctg 3 00 
ctggccctga gtccgcaggc acccggttcc cggggggctg aagcagtgtg gaccgcgtac 3 60 
ctcaacgtgt cctggcgggt tccgcacacg ggagtgaacc gtacggtgtg ggagctgagc 420 
gaggagggcg tgtacggccc ggactcgccg ctggagcctg tggctggggt cctggtaccg 480 
cccgacgggc ccggggcgct taacgcctgt aacccgcaca cgaatttcac ggtgcccacg 540 
gtttggggaa gcaccgtgca agtctcttgg ttgggcctca tccaacgcgg cgggggctgc 600 
accttcgcag acaagatcca tctggcttat gagagagggg cgtctggagc cgtcatcttt 660 
aacttccccg ggacccgcaa tgaggtcatc cccatgtctc acccgggtgc agtagacatt 720 
gttgcaatca tgattcggca atctgaaagg cacaaaaatt ctgcaatcta ttcaaagagg 780 
catacaagtg acaatggtca tagaagtagg gaaaaaacat ggcccttggg tgaatcacta 840 
ttcaattttt ttcgttttct gtgtcctttt ttattattac ggcgggcaac tgtgggctat 900 
tttatctttt attctgctcg aaggctacgg aatgcaagag ctcaaagcag gaagcagagg 960 
ccaattaaag gcagatgcta aaaaagctat tggaaggctt tcaactacgc acactgaaac 1020 
aaggagacaa gggaaattgg ccctgatggg agatagttgt gctgtgtgca ttgaattgta 1080 
taaaccaaat gatttggtac gcatcttaac gtgcaaccat attttccata agacatgtgt 1140 
tgacccatgg ctgttagaac acaggacttg ccccatgtgc aaatgtgaca tactcaaagc 1200 
tttgggaatt gaggtggatg ttgaagatgg atcagtgtcc tttacaagtt tccctgatat 1260 
ccaatagaaa tatctaatag tgcctcctcc catgaagagg ataatcgcag cgagaccgca 1320 
tcatctggat atgcttcagt acagggaaca gatgaaccgc cttctggagg aacacgtgca 1380 
gtcaacaaaa tgaaagtcta cagctggtaa aaccatgaag caaattctgg tggcagtgga 1440 
tgttattcct catgttgaca acccaaccct tntttttgga agaagactgg aaaanctcct 1500 
aatcaagaga ctgctgttcg agaaattaaa tcttaaaatc tgtgtaaata gaaaactgtg 1560 
aaccattaag taataacaga actgccaatc agggcctagt ttctattaat aaattggata 1620 
aatttaataa aataagagtg atactgaaag tgctcagatg actaatatta tgctatagtt 1680 
aaatggctta aaatatttaa cctgttaact tttttccaca aactcattat aatatttttc 1740 
ataggcaagt ttcctctcag tagtgataac aacattttta gacattcaaa actgtcttca 1800 
agaagtcacg tttttcatct tataacaatt ttcttataaa aacatgttgc ttcttaaaat 1860 



tctgcaaaga cttcgacgcc 
tcaacgtgtg ccttctgagc 
gcgcccagtg gctaaggcgg 
ccgagctcct gggttgccag 
ttcattccca gtttccaggg 
ggatttcacg ggaccagcgg 
ttgttgccac aaacggtcgt 
gccggggcaa caccagttac 
cggccctctt gctgctgcca 
ctcccttggg acggacacaa 
atccctgggt tccaagcatg 
gcttgtttcc ccctccactt 
atggagataa cagagttctg 
cccccccgaa aaaaatgtga 
cgtcaggcct gggggcagtt 
gtgtgtgtct gtgaggcacg 
gctctattca tgggaccaca 
tcccccatat atggtgcctc 
cctgtctccc tcttcttttc 
ggttctgaaa aaaaaaa 



ctcatcctgg gatctggaga 1380 
ccagatgacg gcggtgggga 1440 
tgccctggtg accaaggacg 1500 
cgagttacca cggggaccag 1560 
cccggctatt tggacactag 1620 
cttgtgaccc atctttcctg 1680 
cctcgcgctt gctagcgctg 1740 
tgtgagcatc accctgggtg 1800 
accaaatcag tattagcttt 1860 
agactaggca tgaggcactc 1920 
ggacacaggg ggtagcctgg 1980 
tccccgggga aaaccccacc 2040 
gccccttccc atccccatct 2100 
gcacgttaaa cccctccctt 2160 
tggtacggga acatatttac 222 0 
ggtgtgcgtg gacacagtct 2280 
gcaggaggga gcagtttgcc 2340 
agtgggcccc cgagttccag 2400 
cgcacctccg atctttgtgg 2460 

2497 
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gtggagtagc ctgtaatcac tttattttat gatagtatct taatgaaaaa tactacttct 1920 
ttagcttggg ctacatgtgt cagggttttt ctccaggtgc ttatattgat ctggaattgt 1980 
aatgtaaaaa gcaatgcaaa cttaggcgag tacttcttga aaatgtctat ttaagctgct 2040 
ttaagttaat agaaaagatt aaagcaaaat attcattttt tactttttct tatttttaaa 2100 
attaggctga atgtacttca tgtgatttgt caaccatagt ttatcagaga ttatgggact 2160 
gaattgattg gtatattagt gacatcaact tgacactaga ttagacataa aattccttac 2220 
aaaaatactg tgtaactatt tctcaaactt gtgggatttt tcaaaagctc agtatatgaa 2280 
tcattcatac tgtttgaaat tgcgtaatga ccagagtaag taacactgaa tattgggcca 2340 
ttgatcctcc gttccatgaa ttagtctacc agaaaaaaaa tggttctgta aaaattagtc 2400 
ctgttggaaa atggtttttc caaacaatgt ttactttgaa aattgagttt atgtttgacc 2460 
ctnaatgggc gtaaaattac attagaataa acgtaaaatt gctgtgccgt gtaactgata 2520 
aattattgtg aaatgcatta ttcactggtg tattgaaaaa agaagaggga gggagaatta 2580 
ccaggtgcca ttaataataa agatttgaag ctatcattcc accaatagtt aaatttagag 2640 
actaatttaa aatatgcaca tttaatttgt acatctgtga tggcttattg tatatagaat 2700 
afcttgtatac aaatatatag cagaatttag gcaaaaaata aaacagacat gtatttttgt 2760 
gtgctgaatg gatgaaacca attgcattct tgtacactga tatacaaatg ctgtaaatat 2820 
gtccccattt ttattgattc tctttaaata taaaatgtaa ataaaatatt ccaata 2876 

<210> 34 

<211> 1288 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1144066 . 1 : 2000MAY01 
<220> 

<221> unsure 
<222> 1243 

<223> a, t, c, g, or other 
<400> 34 

ggggtgcgac gccgagggcg ggggagcgcg cgccgctgct cccggaccgg gccgcgcacg 60 
ccgcctcagg aaccatcact gttgctggga ggcgacctgt acaaatccta agcgaatttt 120 
ttggagcatt ttcaccccgg aaactcgcca tccagaagtg tgcttcccgc acagctgcag 180 
ccatggggtc tgaggaccac ggcgcccaga aacccagctg taaaatcatg acgtttcgcc 240 
caaccatggg agaatttaaa gacttcaaca aatacgtggg ctacatagag tcgcagggag 3 00 
cccaccgggc gggcctgggc aagatcatcc ccccgaagga gtggaagccg cggcagacgt 3 60 
atgatgacat cgacgacgtg gtgatcccgg ggcccatcca gcaggtggtg acgggccagt 420 
cgggcctctt cacgcagtac aatatccaga agaagggcat gacagtgggc gagtaccgcc 480 
gcctgggcaa cagcgagaag tactgtaccc cgcgggacca ggactttgac gaccttgaac 540 
gcaaatactg ggaaggaacg ctcaccttgt gtctccccga tctacggggc tgacatcagc 600 
ggctcttggt atgatgacga cgtggcccag tggaacatcg ggagcctccg gaccatcctg 660 
gacatggtgg agcgcgagtg cggcaccatc atcgagggcg tgaacacgcc ctacctgtac 720 
ttcggcatgt ggaagaccac cttcgcctgg cacaccgagg acatggtacc tgtacagcat 780 
caactacctg cactttgggg agcctaagtc ctggtgagtg tctacactgg ccctgccgcc 840 
ggccggaccg agagcccctc gggagggagt caatcccggg tacacggctg ggcgccgtgg 900 
caggggcccc accaggtgag gccgcaaagg tcggcctatg acggctggag atcttccgga 960 
ccgcctgggg tcacccacca gctttggggt gggggatgtg cacccccaga gccgaagctc 102 0 
ccaggcccct agagcttgcg ctttgtaccc cggagtgccc cccattgagc tgtgagcggc 1080 
cccaggtgtc cccatggcca ggagcgtggt cttgagcctc ctgagctgcc caggctgtgc 1140 
tgcctcacag ccaagtggag acgttcctgg tgaagggaca ctgtccatgc tgcccagagg 1200 
ggcctggcca ggatgaccct gcagccgctc cctcgcagtc tcngccctgg cacgtctggg 1260 
ccaggcccta cagttaggag ggcagggc 1288 

<210> 35 

<211> 5271 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 243660 . 4 : 2000MAY01 
<220> 

<221> unsure 
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<222> 3667 

<223> a, t, c, g, or other 
<400> 35 

tgaccctgag cggccccctg gagccacatg 
accctttggg ggtggtggaa tctggtaagg 
ccggcccccc aggagagccc cggctggaca 
ttcttttcaa taaggtctcc tgagatccag 
cagtccctca gtgccctacc caaggaacgg 
cccaacaagc gggagaagga ccacatccgg 
aagcggcacc gggccctggt ccaccagcgt 
aagatcctgg gcgagtggtg gtatgccctg 
ctggccttcc aggtgaagga ggcccacttc 
caaggaccga aagaagtcca gctcagaggc 
ggcacaagga gacgcgggag cggagcatgt 
tgtgtcctcg tgagctcctg tccgttgcag 
ctccggggag cagctcctgt ggggcagaac 
cccggccccg agctttctcc cacagcgggg 
gtcaggcgct acaggaactg acgcagatgg 
agccttctac ccagtatgga gctccaggac 
tggcggccac tgggcggccc ccgctgctgc 
ccagtgagga catgacgagt gatgaggagc 
atgatgtcat tgctgacgat ggcttcggcc 
gggtgaccga cagcgagagt ggggacagct 
ttggtcggaa ggtgttttca cctgtgatcc 
tggaccctga gcccccaggg cccccggatc 
ccgccccatc ctcctctgcg tcctcgcctg 
tctcactggg ctcaggaacc ttcaaggccc 
ccctacggcc cccaccccct ggggctgggg 
tcctcccaat ggatcctgcc accttccggc 
agccaccagg cccctcagtc atcgcggccc 
cactggtgct gcccccaaac aaggaggagc 
cccccgcccc atcactggcc tatggggccc 
ccatggtcac caatgtggtg cggcctgtca 
ccttccccac ctctggccgg gctgaggcgt 
aaatgggcac tgggtctcgg gtgcctgggg 
cggacaagaa gtcggcagca gccacctcac 
tgggcactgt ggggaaggcg cctgccactg 
atggggcccc tgcgccccct gctgtccagt 
ccactgcggg ctcaggagca ggtgctggga 
tcctgcaacc aggtgccctg ggcaaggctg 
ccacgctgcc ccagcagctt caggtggcac 
cagcggctcc catgcggccc tgcacccacc 
acttccacca acggcaaagt cctggctgcc 
cagtctgtac cctccgcccc accccccaaa 
cccccgggtg gctcagccca gctgctgcct 
agcatgtcat ttgcggggtg gaaggggcgg 
cccttctcag tacctgtgca aaatggtgcc 
ccggtgccct gtgagcacac ccagcggcct 
tccctggacc cacctgctca atctcagaag 
cctatgtgca gtcagcgggc gggcacgcgc 
aggctggaac agtcacctcg tacgggccca 
tggggcccag cggccccgcc ttcgtgcagc 
tggctcccgg tcaggtgggc gtgtcacctg 
cagcccccgg aggtcctgtc afcaacagcat 
cagcacccct tggcccagcc atcccaggcc 
agcacaaccc cacctgcagc caccattctg 
accccagccc cgactagccc tttccccagc 
gtggccccca aggcccagcg gcccagcccg 
gccagcattc ccgtggggtc ctttgaggca 
cggcaggcgt ctggagcctg gcccagtccg 
gtgggcagcc cacaccacca gcccctccac 
ggagcagccc ccacacgtgg ccccgcacac 
ggccctgaga cccatggcca gcaaattccc 
gcagggcctg gagaatcgtg gggagcctcc 
tgtagcnccc tggtggcagc agcgagagca 
gagcgcaatg gaggctgtgc tggtactggc 



ccctgagagc ccaggacccg gaccccccac 60 
gtccgcctcc caccacggag gaggaggcct 120 
gtgagacaga gagtgaccat gatgatggct 180 
ttgcctctac cgcccggaaa acgtcggacc 240 
gactcatctt ctgagaagga tggacgcagc 300 
cggcccatga atgccttcat gatcttcagc 360 
catcccaacc aggacaaccg gaccgtcagc 420 
gggcccaagg agaagcagaa gtaccacgac 480 
aaggcccacc cagcattgga agtggtgcaa 540 
caagcccacg gagcctgggg ctggcaggag 600 
cggagacggg cactgctgcg tgcccctggg 660 
cccagacact cctgagctca gacaccaagg 720 
ggctacacac agttggggga cctggctcag 780 
tacacagcct ggacggcgga gaagtagaca 840 
tgtcfcggccc tgcatcgtac tctggcccaa 900 
cctttgcagc ccctggtgag ggaggtgcct 960 
ccacccgagc ttctcgttct cagcgtgcgg 1020 
gcatggtcat ctgtgaggag gaaggggatg 1080 
ccactgacct tgatctcaag tgcaaggagc 1140 
ctggggagga cccagagggc aacaagggct 1200 
gttcctcctt tacccactgc cgccccccac 1260 
ctcctgtagc ctttggcaaa ggctatggtt 1320 
cttcctcctc agcctcggca gccacctcct 1380 
aggagtctgg tcagggcagc acagcgggcc 1440 
gtccagcgac accttccaag gcaacccggt 1500 
gcaagagacc cgaaagtgtg ggtggcctgg 1560 
ctcccagcgg aggaggaaac atcctgcaga 1620 
aagagggcgg cggagccaga gtgccctccg 1680' 
cagcagctcc cctgtcccgt cctgccgcca 1740 
gcagcactcc tgtgcccatc gcctctaagc 1800 
ctccaaatga cacagcaggt gccaggactg 1860 
gctccccgct gggtgtcagc ttagtgtatt 192 0 
cagccccaca cttggtggct ggacccctgc 1980 
tcactaacct actggtgggc accccggggt 2040 
tcattgccca gggggcccct ggtggtggga 2100 
gtggccccaa tgggccagta cccctgggca 2160 
ggggaatcac ccaggtacag tacatcctgc 2220 
ctgccccagc accagcccct gggaccaagg 2280 
accagcatcc gtttcaccct cccaccgggc 2340 
actgcaccca ctcctggcat ccccatcctg 2400 
gcccagtcag tttctcccgt gcaggccccg 2460 
gggaaggtcc tagtgcctct ggccgcccct 2520 
gacagccaca tgccgacatg gtgagcccag 2580 
cagtccccca gaaagatcat ccagctgacc 2640 
ggtgcctgcc cctgaggacc ccagacacac 2700 
gtccttgttg acctcactcc accagaatca 2760 
tgcccctggg taccagccct gcgtccagcc 2820 
cgagctctgt agctctaggc ttcacctcgc 2880 
ccc tgctctc agcaggccaa gccccactgc 2940 
tgcccagtcc ccagctgccg cctgcctgtg 3000 
tttactctgg cagccctgca cccacctcct 3060 
cccccaagcc tggtctacac tgtggccacc 3120 
cccaagggcc cgccagcccc tgccactgcc 3180 
gccacagcag gttccatgac ctacagctta 3240 
aaggcccccc agaaagtgaa ggcagccatc 3300 
ggtgcctctg ggcggcctgg ccctgcaccc 33 60 
agagccaact gccccagagt ctgagcttga 3420 
ccctgccaga gacctggact cccacggccc 3480 
tgcgtgaagg agcaggacca tgcggccaag 3540 
cagctcatct tcagactggc gcgtccctgg 3 600 
cactcctccc agcccggccc cagctccagc 3660 
gcagtgggcg ggcagccggg gacaccccgt 3720 
aagaaggtga aggtgcggcc cccgacccct 3780 
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gaagaagacc tttgactctg 
ctttgctgag ttgcctgagt 
tctgggccac ctcacacccc 
aactccacgg acctggattc 
agacgctcca gctgcagctc 
atcttcacct ttgaccgtac 
gacaaggtgc catacgtcct 
gtcatgcagc tctttcagga 
caggcccgct atgcagacat 
ggtgcgccag aagatcatgc 
gctcctctcc ctgtaccgcc 
ccccgcaggg ggccctgacc 
ccccgcacac tgcctcacac 
ctccccagcc cttccccccc 
ggacccctcg agaagatgcc 
gggcaggaag gttatctcct 
aatgaggcct gctcctcttg 
ggggcagctg ccggggagag 
acctggcagc gtgtgacctt 
ctgcaaggct ggagggtggt 
ggggagtttc atgcacccct 
gggtgggggc tccctgcact 
ttctgccctc ccaacttggg 
gccccttccc caaggaacaa 
agaagccgca atgagcgaga 

<210> 36 
<211> 6070 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: LI : 334386 . 1 : 2000MAY01 

<400> 36 

ggctagacta ggacatacca aggtggttaa ttgtttgatt gggtgtggag caaatattaa 60 
tcatactgat caagatggtt ggacagcatt aagatctgct gcttggggtg gccatactga 120 
ggtagtttct gcactacttt atgctggcgt aaaagtggat tgtgcagatg ctgatagccg 180 
aacagctttg agagcagcag catggggagg acacgaggat attgtactga atttgctaca 240 
acatggcgct gaagtgaaca aagctgataa tgaaggtaga actgctttga tagcagcagc 300 
atacatggga catagagaga ttgtggaaca cctactggac catggagcag aagtaaatca 360 
tgaggatgtt gatggcagga ctgcactctc tgtagctgca ctttgtgtgc ctgcaagtaa 420 
agggcacgca tcagttgtta gccttttaat tgatcgaggt gctgaagtag atcattgtga 480 
taaagatggc atgactccac tgctggtagc tggctatgaa ggacatgttg actgtggttg 540 
acttgcttct agaaggggga gcagatgtag atcacacaga taacaatggc cgtacacccc 600 
tcttagcagc agcgtctatg ggtcatgcat cagttgtaaa tacacttttg ttttggggtg 660 
cagctgtgga tagtattgat agtgaaggta ggacagtcct cagtatagct tcagcacaag 720 
gaaatgttga ggtggtacgt actctactgg atagagggtt agatgaaaat cacagagatg 780 
atgctggatg gacacctttg cacatggcag cttttgaagg gcacagattg atatgtgaag 840 
cacttattga acaaggtgct agaacaaatg agattgacaa tgatggacga atccctttca 900 
tattagcttc acaagagggt cattatgatt gtgttcaaat attactggaa aacaaatcca 960 
acattgatca aagaggttat gatggaagaa atgcactgcg ggttgctgca ttagaagggc 1020 
acagggacafc tgttgaattg ctttttagcc atggtgctga tgttaactgc aaagatgctg 1080 
atggtcggcc tacactttat atcttggcct tagaaaatca gcttacaatg gccgaatatt 1140 
ttttagaaaa tggtgcaaac gtagaagcaa gtgatgctga aggaaggaca gcacttcatg 1200 
tgtcttgttg gcaaggccat atgggaaatg gtgcaggtcc tgatagcata ccatgcagac 1260 
gtcaatgctg cagacaatga aaagcgctct gctttgcagt ctgcagcctg gcagggccat 1320 
gtaaaagtgg ttcagcttct gattgagcat ggtgctgtag ttgaccatac atgtaaccaa 13 80 
ggtgcaactg cactctgtat tgcagcccag gaagggcaca ttggttgtgg ttcaggtctt 1440 
attagagcat ggtgctgatc caaaccatgc tgatcaattt ggacgcactg ctatgcgtgt 1500 
tgcagccaaa aatggacatt ctcagataat taaattatta gaaaaatatg gtgcatctag 1560 
tttgaatggc tgttccccat ctcctgttca cacaatggag caaaaacctc tacagtcatt 1620 
gtcttcaaaa gtgcagtcat taacaattaa atcaaatagc tctggtagta ctggtggagg 1680 
ggatatgcag ccttcgttac gtggtttacc ctaatgggcc tactcatgct tttagttctc 1740 
cttcagaatc tccagattct acagttgacc ggcagaagtc atcactgtca aataattccc 1800 
tgaaaagctc aaaaaattca tctttgagaa ctacttcatc tacagcaacg gctcaaacag 1860 



tggacaacag ggtcctgtca 
ttcggcctga ggaggttgct 
gggcccatcc atgggcatct 
agcacccgag gaccccacct 
ggagccctac acccccaaga 
aggtacagaa gccgaggacg 
cacctgcggc agcatccctg 
ccatggcttc ttcccgtcag 
ctttccctcc aaggttgtgt 
aggctgccaa ctcccaccgg 
ccccactggc accgctgcat 
ccacctcacc cagctcggac 
ccccagagtc ggggcctgga 
acccccaggt ccctccaaag 
aggacttata gtaccccctc 
cccgggtaaa gccatttgcg 
taaatacccc cttccctctg 
ctgcaggggc aagtctccct 
cagagctttt cactttatgc 
gcaggccttg ggcccacagg 
ttttttcccc agagggggct 
ttgccacaag gccacgggga 
ttgtactttc taaagaaggt 
aacattgttg atcatggtgc 
ttaaagcctg tttacacaaa 



gaagtggact tcgaagagcg 3840 
gcctccccca acctgcagtc 3900 
ttaccgcaaa gaagaggaag 3 960 
cgcccaagcg caagatgaga 4020 
gtgccaagtg cgagggggac 4080 
tgcttgggga gctagagtat 4140 
gaccacgcgc cagggccctg 4200 
cccaggccac agccgccttc 4260 
ctgcagttga agattcgtga 4320 
agcagccccc tggagctgag 4380 
gcacctgccc ccactcccag 4440 
tctggcacgg cccagtgctg 450 0 
cagcctggct gggagtgggg 4560 
gttgccacag gcagggtgag 4620 
aggacatgga cagtatgtgg 4680 
tcctctccag tttggggcgg 4740 
aagctccctc ccggtgctgg 4800 
cctccaagcc cctgtacata 4860 
aaaaatggct cctgtgaggg 4920 
gagtgcgcct gtggaatagg 4980 
ggactcaggg ttagtttgag 5040 
gggttttctc cttcaccccc 5100 
gattcccccg tggcccttgg 5160 
aatatttctt actgataccg 522 0 
aaaaaaaaag g 5271 



23/69 



WO 01/62922 



PCT/US01/05896 



tgccaattga tagctttcat aacttgtcat 
cacgcagtag taagtcgaca gtcaattgtt 
cagagtcata attcaccaag tagtgaattt 
tcaactaaag caagtaaagg ggggaaatca 
aaaaaagcga aacaaagtaa ttcttcacag 
tttgatagga agaggaccta ttaggccaaa 
atgccagcag aatctcaatg caaaattatg 
tctcaacagc agtttcttat tcaccaacaa 
aatgacaaat ccaaattatc atcttcagag 
cccacgaaca atgcaagata gagggcatca 
gacagaatta agcctttaaa caagctctga 
tcaactataa aaaggaaaca ccattataaa 
tgtgatggag tggttcttca gctactggat 
aaacataaaa aatgaagaat gtgatcttct 
tgcctaaata gtaaggctgc cttctcaatg 
gtgtgctttg tattcactac acaggaataa 
agttccctga taaaagctga aaagaaaatt 
aaatgtgcat gtgccataat caaatatata 
tttttctttt tgtggcaaaa gaaacttaaa 
gtatggcctg tttcttgtat cttgacaaga 
tgttctatgt tcactcacac cttcagcatt 
cattgaaatg aaaaaatgat tgctcgcgca 
gggacgctgc tttaagtagt tccgatctgg 
gctaaacatg tcagaacaaa caaccaacca 
ccgcctggga tacaatttca tctttccatt 
agacttatca ttcctgaggc atgaaaattc 
tgtgtgcaga gagaaatgca cctgtctatc 
tcattgacta aactgacctc ttcctcctgg 
ctcagcggtt tctattttgt aaattgctgc 
tggattaatt ttaatggttt ggttggatga 
cttcacagat gacaccactc ccctgtcaat 
taatagtgat ggatttgcac ttttctatcc 
aattgcatgc aggagggctg gatgccaggg 
aaattggttc aaatgagcat gtgtcccaca 
agctgtagct tagatgggtc gtttatatgt 
caaggttgtg gatttgatct tagaatgggc 
ttcttttcat gtactgttga tgaggaatga 
accattgcta tttctagaac aaaatcttta 
gtcatcttca tatatgaagg acacatgtgg 
attatgataa atgtttcact tgaatcttat 
aagtcagcaa aactcatttt gtctgtcatc 
ttgataagtt caattgatat taaggtgaca 
ttcaagtatt gaattcttgt ccctatggga 
acaaagtagc cttaaaagtg ttaatgagtc 
agaaccaact ttactcatat ctcaaataca 
gctctaattt" ctattacatg taattttctt 
ttctattaac tttgaattat tgaaattgta 
gcaccttgtg tccaaagggc gaagatatga 
agctaactgt tttcatatcc atactatata 
attggaaaaa tacgattttc aaaagtaagt 
ttttggaata gaacataaat gactttgagt 
gatgccctgc cttgcttatt tgtttctgtg 
tctgctccat gtgggaaagt aaaaattgtg 
tttttgaaag ttaaaatctt taaattttat 
atggcatgtg cttataacat ttttataaca 
gagtttataa taacggaaag tgataaaaat 
ctaaggaaag gttttggaag gaaaaatgac 
ccgaataatt tttgttcttt gttttcaccc 
gtttgatacc ttaaaaaaca gaagtaaata 
aatatttgtt ttgcttaatg tcctccattt 
aaatgctttc tttttatttt tcccttgggt 
attggaattt cacattgtaa aagttttatt 
tccctgtgta tatgaaagtg ccataataaa 
taattttgtc cttgaaccag tagaacatgc 
gttgttttaa gttattatct taaataaatc 
aagaagaaag tgaaatataa cactgtccaa 
tatgtcttcc taagcctttt taaagactta 



ttacagaaca aattcagcag cattcattgc 1920 
tccccatctt ccacaacaca gtccttagga 1980 
gagtggagtc aagtaaagcc cagtttgaag 2040 
gaaaattctg ccaagtctgg atcagctggg 2100 
ccaaaggttt tagaatatga aatgactcag 2160 
tccgggactg gctggcaccg ccttaaacaa 2220 
ataccttcag ctcagcagga aattggtcga 2280 
agtggtggaa cagaagaaga gaaatggaat 2340 
caaccaggtt tttcttggta gggtttcagt 2400 
ggaagtgttg gagggatacc cttcctcaga 2460 
agcttcagat tgaaggttct gaccctagct 2520 
agtttcctat tctgtgaaac agaaggacat 2580 
gggaaacata tgcctgttga tttgctgaaa 2640 
ggcagtacag ttaccttaat tactgtaatg 2700 
taaccctctg tgcttaaaaa atttcatttt 2760 
gcacttttta aaaatgcaga tacatactgc 2820 
tgagtatttt aagttaaaat tgtgataaaa 2880 
tgaaaaggca gtgttccttg tatttatttt 2940 
catactgttt cagtcacatt gcattgtagt 3000 
cgtagctcaa taaacacata tcttgcaacg 3060 
ggataaaaat catttcctat ataaatatca 3120 
gtttacaaca actcatttta tagtacttta 3180 
actctcccag tagaattctt ctcatctctg 3240 
gtctgttggc agaacaaagt cctatttcat 33 00 
cacctttgtc attccacctc ctaagaagac 33 60 
tcagggacaa agccatgcct cagtcacatg 3420 
taagggtaga tttttgatcc ctggaataat 3480 
gctaaataaa ttaattttgc tggcttctct 3540 
atgaccaaaa tagccccact caaaatcaat 3600 
atattctgga tgaatataaa atgtgctgcc 3660 
catagcacat gtgtactttt tattgttact 3720 
tcatactctt tcctgttttc ttctttgtac 3780 
ttaagagaga tattcatgac aaggaaggta 3840 
gccttagtct ccttactctt aaatcagtgg 3900 
ctggagaagt tgtcataaca gtttagaagc 3960 
ctgttaatct tatagaaccc agaaattctg 4020 
ggaaagagaa tttggagatt cagcacacag 4080 
gaatatgtcc taataacaaa ggtcagtaat 4140 
atacattgtt cgtggaaata gaaatgtata 4200 
tggtaaggct tttcttgctt ttatttttta 4260 
tataagtcat agtgaggact acgagataag 4320 
tggcaatatt aataactcaa atgtgaatgt 4380 
catttattaa ataaaattaa tgggactctt 4440 
tattaataaa tatgaacaca tactattatt 4500 
gtacatttac attactggta aagggatgaa 4560 
tagaaagaga accctgaaag ctgccagttt 4620 
tttatttaat tttattgttt ttacaaaatt 4680 
cattgcataa gggatttatg tttttcaaag 4740 
cacttgaagc aattggtagg aagagaacct 4800 
attcctcggg atgtttttat ataaattatg 4860 
taggataagg atgataggat gggtgctgga 4920 
ggggatttaa tactataaat gaaaatggct 4980 
cctccaaata taaaactcct tcaatagata 5040 
aaatcagctg ttgccacgta cacaactatg 5100 
ttccgtctgt gtctattccc taaatatcat 5160 
acagtaatga aataaaaatc tggaatgttt 5220 
ctgaaatcag tgcgtactgc aaaggtacag 5280 
ccaccctcac tggaattctc accaaaaata 5340 
cttttcctca atattgtttt gggctatgca 5400 
acacttgctc agcaaatggt agttgcaaac 5460 
ttgagggtat gtaaatagcc aaaaatgtac 5520 
ttatcccttg gtatgatatt actcaaaaaa 5580 
tatatttgct ttacagagaa gatcttgttt 5640 
atgatatgca tatagcataa acaactgtta 5700 
ctgagcaaat gaatttggaa acattttgca 5760 
aggaaggtag aaaaacaaag atttactgtt 5820 
atgttctttt cccccccccg tgactgatta 5880 
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tatactatat cacaccatgt caggttttgt gcctctgaga attgcagtaa tccaataact 5940 
tttgtatatg tgtgctcctt gatcatcaga atattatggc catcttatgg cggatatttt 6000 
gggagtttat tgcaaacatg gtcattcatt ttctaaataa aatttgtgtg tttcttcact 6060 
cagtaaaaaa 6070 

<210> 37 

<211> 3474 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 347572 . 1 : 2000MAY01 
<400> 37 

gtcattcagt ggatgtgatc tgtggctcac aggggacgat gtcaagctcc ttcctggctc 60 
cttctcagcc ttgttgcctg taactggctg ctcagtccac cattgaggaa caggccaaga 120 
catttttgga caagtttaac cacgaagccg aagacctgtt ctatcaaagt tcacgttgct 180 
tccttggaat tataacacca atattactga agagaatgtc caacaacatg caataagttg 240 
ctggcgagac aaatgtgtct agcccttttfc acaaggaaca gtccacactt gcccaagatg 300 
tatccactac aagcaaactt cacgacatct ccacatgtca acgcttcagc tgtgcacggc 360 
ttcttcaagc cataaaactg tgagtcttca ggttggtcat cacgaagcac agagagcaaa 420 
ccggttgaac acaatttcta atatacaaat ggagccacca atcctaacag taactggaaa 480 
acgtcgtaac ccagataatc cacaagaatg cttattactt gaaccaggtt tgaatgaaat 540 
aatggcaaac agtttagact acaatgagag gctctgggct tgggaaagct ggagatctga 600 
ggtcggcaag cagctgaggc cattatatga agagtatgtg gtcttgaaaa atgagatggc 660 
aagagcaaat cattatgagg acttattggg gattattgga gaggagacta tgaagtaaat 720 
ggggtaaata gtggatatga ttacagccgc ggccagttga ttgaagatgt ggaacatacc 780 
tgttgaagag attaaaccat tgataggaac atcttcagcc ctatgtgagg gccaagttga 840 
tgaatgccta tccttcctat atcagtccaa ttggatgcct ccctgctcat ttgcttggtg 900 
atatgtgcgg gtagattttg gacaaatctg tactctttga cagttccctt tggacagaaa 960 
ccaaacatag atgttactga tgcaatggtg gaccaggcct gggatgcaca gagaatattc 1020 
aaggagtccg cagaacttct ttgtatctgt tggtcttcct tatatgactc taggattctg 1080 
cggcaaattc catgctatac ggacccagga aatgttcaga aagcactctg ccatccccac 1140 
agcttgggac ctggggaagg gcgacttcag agatccttat gtgcacaaag ggtaacaatg 12 00 
gacgacttcc tgacagctca tcatgagatg gggcatatcc agtatgatat ggcatatgcc 1260 
ggccaacctt tttctgctaa ggaaatggag cttaatgaag gattccatga agctgttggg 1320 
gaaatcatgt cactttctgc agccacacct aagcatttaa aatccattgg tcttctgtca 1380 
cccgagtttt caacgaacga caatgaaaca gaaataaact tcctgctcaa acaagcactc 1440 
acgattgttg ggactctgcc atttacttac atgttagaga agtggaggtg gatggtcttt 1500 
aaacggggaa attcccaaag accagtgggt gaaaaaggtg gtgggagatg aagcgaaaga 1560 
atagttgggg tgtgtggaac ctgtgcccca tgatgaaaca tatctgtgac cccgcatctc 1620 
tgttccatgt ttctaatgat tactcattca .ttcgatatta cacaaggacc ctgttaccaa 1680 
ttccagtttc aaagaagcac ttttgtcaag cagctaaaca tgaaggccct ctgcacaaat 1740 
tgtgacattc tcaaattcta cagaacgtcg tggacagaac actgttcaat atgctgaggc 1800 
ttggaaaact cagaaccctg gaccctagca ttggaaaatg ttgtaaggac caaagaacat 18 60 
gaatgtaagg ccacctgctc aactacttfcg agcccttatt tacctggctg aaagaccaga 1920 
acaagaattc ttttgtggga tggagtaccg actggagtcc atatgcagac cacagcatca 1980 
caagtgagga taagcctaaa atcagctctt ggcagataaa gcatatgaat ggaacgacca 2040 
atgaaatgta cctgttccga tcatctggtt ggatattgtt aattgaggca gtacttttta 2100 
acaagtaaaa aatcagatga ttctttttgg ggaggaggat gtgcgagtgg ctaatttgaa 2160 
accaagaatc tcctttaatt tctttgtcac tgcacctaaa aatgtgtctg gatatcattc 2220 
ctagaaactg aagttgaaaa ggccatcagg atgtcccgga gccgtactcc atgatgcttt 2280 
ccgtctgaat gacgacagcc tagagtttct ggggatacac ccaacacttg gacctcctaa 2340 
ccagccccct gtttccatat ggctgattgt ttttggagtt gtgatgggag tgataattgt 2400 
tggccatggt catcctggat cttcactgga atcagagatc ggaagaagaa aaataaagca 2460 
agaagtggag aataatcctt tatgcctcca tcgatattag ctaaggagta taaataatcc 2520 
aggattccga aacactgatg atgttcagac ctccttttag aaaaatctat gtttttcctc 2580 
ttgaggtgat tttgttgtat gtaaatgtta atttcatggt atagaaaata taagatgata 2640 
aagatatcat taaatgtcaa aactatgact ctgttcagaa aaaatattgt ccaaagacaa 27 00 
caagtgccaa ggagagagca tcttcattga cattgctttc aagtatttat ttctgtctct 2760 
ggatttgact tctgttctgt ttcttaataa ggattttgta ttagagtata ttagggaaag 2820 
tgtgtatttg gtctcacagg ctgttcaggg ataatctaca atgtaaatgt ctgtctgaat 2880 
ttcttgaagt tgaaaatcaa ggatatatca ttggagcata gtgttggatc ttgtatggaa 2940 
tatggatgga tcacttgfcaa ggatcagtgc ctgggaactg gtgtagcttg caaggattga 3000 
gaatggcagt gcattagctc acttgtcact ggcatccatt ggtcaaggac tgacatgctt 3 060 
tccttcacag tgaactcagt tcaagtacta tggtgatttg cctacagtga tgttgtggaa 3120 
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tctgatctat gctttccttc aaggttgaca ggtcctaaag agagacagaa tccagggtac 3180 
aggtagagga catttgcttt ttcacttcca aggtgtcttg tatcaacatc ttcctgtaca 3240 
aacactgaaa tctagagctc aggggctctc gcgtgaatct cccagagaca tgcctgtata 33 00 
gaaatctcta tttctagctg ttctctaact gtcggagtga tatggaatat tccaactgta 33 60 
tgttcaccct ctgaagtggg tacccagtct cttaaatctt ttgtatttgc tcacagtgtt 3420 
tgagcagtgc tgagcccaaa gcagacactc aataaatgct agatttaccc cctc 3474 

<210> 38 
<211> 3474 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 817314 . 1 : 2000MAY01 
<400> 38 

gctttcagag catcctcact ccgcccagtt cggtgccagc tgcgtgggct ccagcttcga 60 
tcgttttcct tggaatgctc caaaactcag cagcgactaa gggaattcca ttggaatttg 120 
ccgggcgtgc tctcaccccg cacggcaccc gcgccgtcag tcctcggatc ccatcacttc 180 
agcccgaaga ttgcaacttt gcagagacga agaaatagca tggcatgaaa tatggctcag 240 
ttctattaca aaagaaatgt taatgctccc tatagagacc gcatccctct aaggatagta 300 
agagcagaat cagaactctc gccatcagaa aaagcctact tgaatgctgt ggaaaaggga 3 60 
gattatgcca gtgtcaagaa atccctagag gaagctgaaa tttattttaa aatcaatatt 420 
aattgcattg atcctctcgg aagaactgct ctcctcattg caattgaaaa tgagaacttg 480 
gagctcatcg aactactctt aagctttaat gtctatgttg gagatgctct attacatgct 540 
atcagaaaag aagtcgtcgg agctgttgag ctgttattga accacaaaaa acctagtgga 600 
gaaaaacagg tgcctcctat actccttgat aagcagttct ctgaattcac tccagacatt 660 
acaccaatca ttttggcagc ccatacaaat aattatgaga taataaaact cttggttcag 720 
aaaggagtct cagtgcctcg accccacgag gtccgctgta actgtgtgga atgcgtgtcc 780 
agttcagatg tggacagcct ccgtcactca cgctccagac tcaacatcta caaggccttg 840 
gccagtccct ctctcattgc actgtcaagc gaagatcctt ttctcacagc ctttcagtta 900 
agttgggaac ttcaggaact gagcaaggtg gaaaatgaat tcaagtcgga gtatgaagag 960 
ctgtcacggc agtgcaaaca atttgctaag gacctactgg atcagacgag aagttccaga 1020 
gaactggaaa tcattcttaa ttaccgagat gacaatagtc tcatagaaga acaaagtgga 1080 
aatgatcttg caagactaaa attggccatt aagtaccgtc aaaaagagtt tgttgcccag 1140 
cccaattgtc aacagctgct ggcatctcgc tggtacgatg agtttccagg ctggaggaga 1200 
agacactggg cagtgaagat ggtgacatgt ttcataatag gacttctttt tcctgtcttc 12 60 
tctgtgtgct acctgatagc tcccaaaagc ccacttggac tgttcatcag gaagccattt 1320 
atcaagttta tctgccacac agcctcctat ttgacttttt tgttcctgct gctgcttgcc 1380 
tctcagcaca tcgacaggtc agacttgaac aggcaaggtc caccaccaac catcgtcgag 1440 
tggatgatat taccgtgggt cctgggcttc atatggggag aaattaaaca gatgtgggat 1500 
ggcggacttc aggactacat ccatgattgg tggaatctaa tggactttgt aatgaactcc 1560 
ttatatttag caacaatctc cttgaaaatt gttgcatttg taaagtacag tgcccttaat 1620 
ccacgagaat catgggacat gtggcatccc actctggtgg cagaggcttt atttgctatt 1680 
gcaaacatct tcagttctct gcgtctgatc tcactgttta ctgcaaattc tcacctggga 1740 
cctctgcaaa tatctctggg aagaatgctc ctggacattt tgaagtttct attcatatac 1800 
tgccttgtgt tgctagcatt tgcaaatggc ctaaatcaat tgtacttcta ttatgaagaa 1860 
acgaaagggt taacctgcaa aggcataaga tgtgaaaagc agaataatgc attttcaacg 1920 
ttatttgaga cactgcagtc cctgttttgg tcaatatttg ggctcatcaa tttatatgtg 1980 
accaatgtca aagcacagca tgaatttact gagtttgttg gtgccaccct gtttggggac 2040 
attacaatgt catctctctg gttgttctac tcaacatgtt aatagctatg atgaataatt 2100 
cttaccaact gattgctgac catgcagata tagaatggaa atttgcacga acaaagcttt 2160 
ggatgagtta ttttgaagaa ggaggtactc tgcchactcc cttcaatgtc atcccgagcc 2220 
ccaagtctct ctggtacctg atcaaatgga tctggacaca cttgtgcaag aaaaagatga 2280 
gaagaaagcc agaaagtttt ggaacaatag gggtaagaac acagcatagg cgagctgctg 2340 
ataacttgag aagacatcac caataccaag aagttatgag gaacctggtg aagcgatacg 2400 
ttgctgcaat gattagagat gctaaagact gaagaaggcc tgaccgaaga gaactttaag 2460 
gaactaaagc aagacatttc tagtttccgc tttgaagtcc tgggattact aagaggaagc 2520 
aaactttcca caatacaatc tgcgaatgcc tcgaaggagt cttcaaattc ggcagactca 2580 
gatgaaaaga gtgatagcga aggtaatagc aaggacaaga aaaagaattt cagccttttt 2640 
gatttaacca ccctgattca tccgagatca gcagcaattg cctctgaaag acataacata 2700 
agcaatggct ctgccctggt ggttcaggag ccgcccaggg agaagcagag aaaagtgaat 27 60 
tttgtgaccg atatcaaaaa ctttgggtta tttcatagac gatcaaaaca aaatgctgct 2820 
gagcaaaatg caaaccaaat cttctctgtt tcagaagaag ttgctcgtca acaggctgca 2880 
ggaccacttg agagaaatat tcaatctgga atctcgagga ttagcttcat cggggtgacc 2940 
tgagcattcc cggtctcagt gaacaatgtg tgttagtaga ccatagagta aggaatacgg 3000 
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acacactggg gttacaggta ggaaagagag tcgtgtccat tcaagtcaga gaaggtgtgt 3060 
ggtggaggac acggttccta taataccaaa ggagaaacat gcaaaagaag aggactctag 3120 
tatagatcta tgatctaaac ctcccagaca cagtcaccca cgaagattac gtgaccacaa 3180 
gattgtgata cttgaaggag gaagcgttta ccatacacat acgtattttc cgtagtgctc 3240 
tgggtggggg aaaatgttta aattgtatta gcaaatgact aaattacact ttatagcgtt 3300 
taatcaagat gtggaatatt acctgtaaca tgtttaaatt aaggcaaagg caatcaaaaa 3360 
cctttttgtt ttgtagcctg cttttgcttt cacaatttgt cttacaattg tttttgttaa 3420 
taaataaatg caccttgtat tcttgtactg ttgcaataac ccacagaaac attt 3474 

<210> 39 

<211> 1613 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 00 029 0 . 1 : 2000MAY01 
<400> 39 

gtgagggctc ttgggttagt tcctgttagg ccccggccgg gggagtaggt tgaagtctcc 60 
taacgatgcc cggtgggctg cgggcaccgg gagctgtgaa gggaacgtga gggggcggcg 120 
tagtggagac ccacggcagg cc tgaagaag agcggcggcc gagcccgcct tccctgcacc 180 
atgctcatag aggatgtgga tgccctcaag tcctggctgg ccaagttact ggagccgata 240 
tgtgatgctg atccttcagc cttagccaac tatgttgtag cactggtcaa gaaggacaaa 3 00 
cctgagaaag aattaaaagc cttttgtgct gatcaacttg atgtcttttt acaaaaagaa 3 60 
acttcaggtt tcgtggacaa actatttgac agtctctata ctaagaacta ccttccactt 420 
ttggaaccag taaaagcctg agccaaaacc actagttcca agaaaaagac gaaattaccg 480 
aagaggtatt tcaggagcca gcagaggaag aacgagatgg cagaaaaaag aaatatccta 540 
gtccccagaa gactcgttca caatctactg aacgaaggac acgtgagaac aaaagagacg 600 
acgggacctt ggcgagacta tgaccggtac tatgagcgga atgaattgta ccgtgagaag 660 
tatgactgga gaagaggcag gagtcagagt cggagtaaga gtcgaggcct gagtcgcagt 720 
agaagccgaa gtagggggcg cacgcaaaga ccgggatcca aataggaatg ttgagcacag 780 
ggaaagatcg aagtttaaga gtgacaggaa tgacctggag aagttcctat gtgcctgtgt 840 
ctgcaccacc tccaaactct tctgagcagt attcctctgg ggcacagtct attcccagca 900 - 
ctgttactgt gatcgcaccc tggtcaccca ctcttgaaaa cacaacttgg gagttggtct 9 60 
tattactatt aaaaattatt agctcttcca attcttttgg tcgaaaccta ccaccaaaga 1020 
ggcgatgcag agattatgat gaaagaggat tttgtgtact tggtgacctt tgtcagtttg 1080 
atcatggaaa tgatccccta gttgttgatg aagttgctct gccaagtatg attcctttcc 1140 
caccccctcc tcctgggctt cctcctccaa caactcctgg aatgttaatg cctccaatgc 1200 
caggtccagg; cccaggcccg ggcccaggtc caggcccagg cccgggccca ggtccaggtc 1260 
ctggccatag tatgagactt cctgttcccc aaggacatgg tcagcctcca ccatccgttg 1320 
tgcttcccat accaagacca cctataacac aatcaagctt gataaacagc cgtgaccagc 1380 
ctgggacaag tgcagtgccc aatcttgcat cagtgggaac aagactacct cctcctttac 1440 
cccagaacct cctttacaca gtatcagaac gacagcccat gtactctcgt gaacatggtg 1500 
ctgctgcatc tgagcgactt cagttgggga caccgcctcc tctgttggca gctcgtttgg 1560 
tgccacctcg aaacctcatg ggatcctcca ttggatacca tacctcagtc tec 1613 

<210> 40 

<211> 1056 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 023518 . 3 : 2000MAY01 
<400> 40 

ccagaggaaa ctagtcacaa aaaccctgac tatcacctga tagattgett gtgctgcctg 60 
ataattactc gcacttttcc caggctagtg caaatcttca ggggccgtcc aggactacag 120 
agctgtttca ccctaccttg gcttcaatct cttcccccat gctcgaaggt gcggagctgt 180 
acttcaacgt ggaccatggc tacctggagg gcctggttcg aggatgeaag gccagcctcc 240 
tgacccagca agactatatc aacctggtcc agtgtgagac cctagaagct ccattcttcc 300 
aagactgeat gtctgaaaat gctctagatg aactgaatat tgaattgeta cgcaataaac 360 
tatacaagtc ttaccttgag gcattctata aattctgtaa gaatcatggt gatgtcacag 42 0 
cagaagttat gtgtcccatt cttgagtttg aggecgacag acgtgctttt atcatcactc 480 
ttaactcctt tggcactgaa ttgagcaaag aagaccgaga gaccctctat ccaacctttc 540 
ggcaactcta tcctgagggg ctgcggctgt tggctcaggc ggaagacttt gaccagatga 600 
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agaacgtagc ggatcattac ggagtataca aacctttatt tgaagctgta ggtggcagtg 660 
ggggaaagac attggaggac gtgttttacg agcgtgaggt acaaatgaat gtgctggcat 720 
tcaacagaca gttccactac ggtgtgtttt atgcatatgt aaagctgaag gaacaggaaa 780 
ttagaaatat tgtgtggata gcagaatgta tttcacagag gcatcgaact aaaatcaaca 840 
gttacattcc aattttataa cccaagtaag gttctcaaat gtagaaaatt ataaatgtta 900 
aaaggaagtt attgaagaaa ataaaagaaa ttatgttata ttatctagac tacacataag 960 
taagccacac tatatcttca tgagttgcaa atccatggaa acacagtaaa ccaggcctga 1020 
aacaaagcat ttccttggtt tcagtggtat tagatc 1056 

<210> 41 
<211> 3806 
<212> UNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 1084246 . 1 : 2000MAY01 
<400> 41 

cgttacaagc agtgcaggtt taccaacggc ttggggcagc gatatactaa acaaatttaa 60 
tttaaaagca actgtgtgac gattcctcca agcaagaaat tggaattgaa tgtctcaagt 120 
ctcgttgcgg ttgctgaggg gattggatat agggacctgg actccaacat gaagaagcta 180 
gggagaattc atccaaacag gcaagtgttg gcctttattt tgatggtgtt cttgtctcag 240 
gttcgcctcg agcctattcg ttattctgtg ttggaggaaa cagagagcgg ctcctttgta 300 
gcccatctgg ccaaggatct gggcctggga attggggaac tggcctcccg gtcagcccgg 3 60 
gtgctgtctg acgatgacaa gcagcgtttg cagctggatc gtcagactgg agatttgctt 420 
ctgagggaga aactagaccg ggaagagctc tgtggtccta fctgaaccgtg tgtactgcat 480 
ttccaagtgt tcctggaaat gccggtgcaa ttttttcata ggagaattat tgatccagga 540 
tcatatatgt atcactctcc aatattccct gaaagggaag tgctcttgaa aatactagaa 600 
aatagtccag ccgggtactc tatttccgtt gctaatagct gaggatttgg atgtgggcag 660 
caatggtctt caaataatac acaatcagcc ccaattctca ttttcacatt ctcactcgaa 720 
atcatagtga gggcaagaaa tacccagatt tggtgcagga caaaccacta gatcgagagg 780 
agtcagcctg agttacagct taaccctcgt ggcgctggat ggtgggtcac cacctaggtc 840 
tggcacggtc atggttcgaa tcctgatcat ggacatcaat gacaatgctc ctgagtttgt 900 
gcacactcca tatggggtgc aggtcctgga aaacagcccc ctagactctc caattgttag 9 60 
ggtcttagct agagatatag atgctggaaa cttcgggagt gtttcttatg gcttattcca 1020 
agcatcagat gaaattaaac aaactttctc aataaatgaa gtcacgggag aaatactgtt 1080 
gaaaaaaaaa ttggatttcg aaaaaattaa atcttaccat gtagaaattg aggccacaga 1140 
tggaggaggc ctttctggaa aaggcactgt agtcatagag gtggtggatg tgaatgacaa 1200 
tcccccagaa cttatcatat cttcactcac cagctccatc ccagaaaatg ctcctgagac 1260 
ggtagtctct atcttccgaa ttcgagatag agattccgga gaaaatggaa agatgatttg 1320 
ctctattcca gataatctac cgtttattct aaaaccaact ttgaagaatt tttacaccct 1380 
ggtaacgggg gtgaccactg gaccgagaga ccagcactga gtacaacatc accatcgccg 1440 
tcactgactt ggggacaccc aggctgaaaa cccagcagaa cataaccgtg caggtctccg 1500 
acgtcaatga caacgccccc gccttcaccc aaacctccta caccctgttc gtccgcgaga 1560 
acaacagccc cgccctgcac atcggcagtg tcagcgccac agacagagac tcgggcacca 1620 
acgcccaggt cacctactcg ctgctgccgc cccaggaccc gcacctgccc ctcgcctccc 1680 
tggtctccat caacgcagac aacggccacc tgttcgccct caggtcgctg gactacgagg 1740 
ccctgcaggc gttcgagttc cgcgtgggcg cctcagaccg cggttctccg gctttgagca 1800 
gcgaggcgct ggtgcgcgtg ctggtgctgg acaccaacga caactcgccc ttcgtgctgt 1860 
acccgctgca gaatggctcc gcgccctgca ccgagctggt gccccgggcg gccgagccgg 1920 
gctacctggt gaccaaggtg gtggcggtgg acggcgactc gggccagaac gcctggctgt 1980 
cgtaccagct gctcaaggcc acggagcctg ggctgttcgg cgtgtgggcg cacaatggcg 2040 
aggtgcgcac cgccaggctg ctgagcgagc gcgacgcagc caagcacagg ctcgtggtgc '2100 
ttgtcaagga caatggcgag cctccgcgct cggccaccgc cacgctgcac gtgctcctgg 2160 
tggatggctt ctcccagccc tacctgcctc tccctgaggc ggccccggcc caggcccagg 2220 
ccgactctct caccgtctac ctggtggtgg cgttggcctc ggtgtcgtcg ctcttcctct 2280 
tctcggtgct cctgttcgtg gcggtgcggc tgtgcaggag gagcagggcg gcctcggtgg 2340 
gtcgctgctc ggtgcccgag ggcccctttc cagggcatct ggtggacgta agcggcaccg 2400 
ggaccctgtc ccaagagcta ccagtacgag gtgtgtctga caggagactc tgggactggt 2460 
gagttcaagt tcctgaagcc aatatttcct aatctcttgg ttcaggacac cggggaggga 2520 
agttaaggaa aacccccaag ttcagaaata gcttggtatt cagttaagta ttgtatttag 2580 
ttcagtgaac cgcccgttaa gttttgtcaa acttcccact ggcaatgcct ttatttaaaa 2640 
aaattgtcta cttatctgaa atattcatac cacaatttca aacctactca tgtccctgat 2700 
aaagctaaat ttgtcccttt tttattgtta ttaattgcac ttaacatttt tagttatact 2760 
ggatattgag tatggatttt ctctatattt gatctattgg tgattaatct ttttgtaatc 2820 
ataaattact caattaggat aaaaataaat tatgttttaa tgaaattctt aaattaacat 2880 
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ctttttaatg gaacatttaa gtgaatatat gaatattgaa tttctaaata tttgttgtgc 2940 
ctgtctttac catgtaactt aatgtttgca aggccagagt gtttgaaagt tttgtattta 3 000 
actttataat taccttgtcc tttctggttg actatactag gctaagccct cttaatagcc 3 060 
atgagtgtaa aatttagttt actcattttt cacaaattgt aaattaacat ggcacttcac 3120 
tacattggta atacactaaa attgtggtcc ttttcctctt gtgaccacca catgtctagt 3180 
gattattttg tttatttggt tgctacttac ctagcacatt gtaatgttcc atgaatgcta 3240 
atattaaatt ttgtaaaaat aacttattta taaataattt ttaaagagaa aaatctcata 33 00 
taatttgtca taacctttca ataaataaaa ctgttaaatc atgggcctga tatcatctta 33 60 
aaaaaaaatc ctcagaatct gaaataagcc ctaaatttct ccccaaaatc aagactcttg 3420 
agagcatcat aggtctcctt gtgctacctt ttactcccta taaatagaaa tccaagtata 3480 
ctttaatatg tgtatatttt ttggttttcc tacagcttct ccccatcttt caaaagaatc 3540 
acgaaatttc ttctgcacct tggctattct gtttaaatct gataatcagt tgatctcagg 3 600 
tttttcactg tacattactt tgcagatatg gacagccttt acaaaaataa tttttaaatg 3 660 
cttaattatt ttaatttgtt ctttaaggta accttcagtt attttgtatt aatttaactt 3720 
ctcaattatg ccaaagttgc acttgcatga aataaatatt . attttgtcct tgtatagact 3780 
ggaacagtaa taaatttatc tgaatt 3806 

<210> 42 
<211> 623 0 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1165828 . 1 : 2000MAY01 
<220> 

<221> unsure 

<222> 4042, 4725 

<223> a, t, c, g, or other 

<400> 42 

ctcgcttttc ttgcaatatt ttataccttt tcaattcata gaattactca agaaaactac 60 
ctcagttggt tgctactttt tgttgattcc ttttaccaga catgactaag tttctttttc 120 
atcagtagat ttctgggctc ctatafctcac tagagattgc aactcctgga tttctcttac 180 
actagaatcc tatttcgagc catatgggag attctgaatt ccagaacaaa agaattttgt 240 
aatttaaaat tcgtgattgc tcaatggaat cattttaatt gttacttcat ttctgtcgtt 300 
atttaaaact taagtggaga gttttctcag ggataagaaa accacaatca aggtcataca 360 
aaacttttag aggcagtcag tctgctaaga aggctccagc aagagaaacg ggatcttctg 420 
tttcaacaat cattacttaa gaaaaaatta agaaaatgaa ataagttttg cagaataact 480 
gtgaaatttt tattcatgaa atatgtactt acactttggg ccacgtgatg tcactctttg 540 
ccgcgatgtt ctctctgaat ccagacaaat acagcccttt tcccatggga aagaggctca 600 
attctttttc actctctctg tgctgaacga tggcgaacac agcagaatgg gactgacgaa 660 
atcagatgat ttcttctaat ttggaggcaa ttttcactaa ttagaagaag actgagtatt 720 
tgaaatgtta tactcaagtc gaggagatcc agagggtcag cctctactgc tctcgcttct 780 
gatcctcgca atgtgggtgg tggggagcgg ccagctccac tactccgtcc cggaggaagc 840 
cgaacacggc accttcgtgg gccgcatcgc gcaggacctg gggctggagc tggcggagct 900 
ggtgccgcgc ctgttccagt tggattccaa aggccgcggg gaccttctgg aggtaaatct' 960 
gcagaatggc attttgtttg tgaattctcg gatcgaccgc gaggagctgt gcgggcggag 1020 
cgcggagtgc agcatccacc tggaggtgat cgtagacagg ccgctgcagg ttttccatgt 1080 
ggacgtggag gtgaaggaca ttaacgacaa ccctccagtg ttcccagcga cacaaaagaa 1140 
tctgttcatc gcggaatcca ggccgcttga ctctcggttt ccactagagg gcgcgtccga 1200 
tgcagatatc ggggagaacg ccc tgctcac ttacagactg agccccaatg agtatttctt 1260 
cctggacgtg ccaaccagca accagcaggt aaaacctctt ggacttgtat tacggaaact 1320 
tttagacaga gaagaaactc cggagcttca tttattgctc acggccaccg atggaggcaa 13 80 
acccgagctg actggcaccg ttcaattact catcacggta ctggacaaca atgacaatgc 1440 
cccagtgttc gacagaaccc tgtatacggt gaaattacca gaaaacgttt ctatcggaac 1500 
gctggtgatt caccccaatg cctcagattt agacgaaggc ttgaatgggg atattattta 1560 
ctccttctcc agtgatgttt ctccagatat aaaatccaag ttccacatgg accccttaag 1620 
tggggcaatc acagtgatag gacatatgga ttttgaagaa agtagagcac acaagatccc 1680 
agtcgaggct gtcgataaag gcttcccacc cctggctggt cattgtacac ttcttgtgga 1740 
agttgtggat gtaaatgaca atgctccaca gttgactatc aaaacgctct cggttcctgt 1800 
aaaagaggac gcacaactgg ggacagttat tgccctgatt agtgtgatcg acctagacgc 1860 
agatgccaac gggcaggtga cctgctccct gacgccccac gtccccttca agctggtgtc 1920 
cacctacaag aattactact cgttggtgct ggacagagct ctggaccgcg agagtgtgtc 1980 
cgcctacgag ctggtggtta ccgcgcggga cgggggctcg ccttcactgt gggccacggc 2040 
cagggtgtct gtggaggtgg ccgacgtgaa cgacaacgca ccagcgttcg cgcagtccga 2100 
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gtacacggtg ttcgtgaagg agaacaaccc 
gcgggacgct gacgcgcagg agaacgccct 
gggcgagcgc tcgctgtcga gctacgtgtc 
gctgcagccg ttggaccacg aggagctgga 
cgcgggcgtg ccgcctctgg gcagcaacgt 
cgacaatgcg ccggcgctgc tgacacctcg 
gatggtgctg cggtcggtgg gcgccggcgt 
cgactcgggc tacaacgcgt ggctttcata 
catcccgttc cgcgtggggc tgtacacggg 
aacggacgca ccgcgccagc gcctactggt 
gacggccacg gccactgtgc tggtgtcgct 
gtcgcgggcg tcagtgggtg ccacgggccc 
cctgatcatc gccatctgcg cggtgtctag 
tgtgctgcgg tgctcggcga tgcccaccga 
tggtgtgttc tagcgcggtg gggagttggt 
tgctctggcg agggtaagca gaagaccgac 
ttgtgctggg atctacagag cgaacgggag 
agccacgaca gcccaaccct gactggcgtt 
gctctgtgca cctagaggag gctggcattc 
cagtggccaa ccagtatcca gtgcaacacc 
cccagtcggt gcgggtgtca acagcaacag 
ccaaacaatc cgagtcccgg tgagttgccc 
aatcatctcc atccggcagg agcctactta 
cccttcggca aaaaggagga gacccagaaa 
caggagaaaa aagagaaagg gaacagcacg 
caaatgggaa acaagccact tagccagttt 
aggcaattgc cctgctcctt gtttcctatc 
taatctgcag ataagttccc tggtgtctgt 
taaaaagctt tactaagtct ggttgttaac 
aacctataaa gagcagaccc agagtgtgtc 
cttcccagcc ccgccagtga gaggtgtgga 
ttcgatgaca cttgcatggg caggctgaaa 
tggcccaccc tggggggtta anttgttgct 
gagtgccaga tattggctga gaccgagcca 
aagataacac aacgacaaat aaaacagcgg 
ctctaaaggc gaccagacct ttcatagaat 
tcttaggaga caaaacgtac tcgcccacca 
ttggctagtg acgtcattat acctaaaatc 
ttcaggtgtt taaacagaga atacaccgct 
ctatacatgt gcatgtgctc actttattaa 
aagggccaga tcctttttcc aatacttatg 
cacctttccg ctgtttgttg tttcactagg 
tataaagtct ttaaagagaa atatgaatac 
ttcaggtcta caacgggcgc agtttaaaat 
ctcttataga gaattgcctg aaacatctgt 
tttactcttt caggtcatct ctggggctgc 
tctctctttc tctcgtctcg tctctctttt 
ccaccaaccc ttctctaacc caacctatat 
tggttgttct catacaggtg gagcagattt 
tctgggtggt gctagccata caccttcgtc 
cctcctgaat tgtctaattc ttaactaacc 
ctcccacata tgtatggctg ttatggctat 
ctcgtgcttg tgtaatagtg aaaggtaata 
tttattgttg aaagtgaata tcccttataa 
atggagtgag tggttttttt aacccttggt 
gtaaaatttc ttttttaaat ccagatactg 
cccagagcca tctcgtgccc agacttctgc 
ttcttagtaa caattttgga atgaatactg 
aattttacca atctgacctc ttgtgaagtt 
tcctgaaata tcagctcata ggaaagtacc 
ttaattttgg ttataatgta caatttagaa 
attaatataa aagaggtagg agtctgttat 
ctgtctgtgt ctacttttag cttcattctc 
agctctgcag gattgccatg gggtaaaact 
cattgtaggt tgtgatcatt ttggccccac 
gcccttttga actaggagaa tcgggctaat 
tacagcactt tttacatttg cgaagtgcct 



gccgggctgc cacatcttca cggtgtctgc 2160 
ggtgtcctac tcgctggtgg agcggcggtt 2220 
agtgcacgcg gagagcggca aggtgtacgc 2280 
gctgctacag ttccaggtga gcgcgcgcga 2340 
gacgctgcag gtgttcgtgc tggacgagaa 2400 
gatgaggggc actgacggcg cagtgagcga 2460 
agtggtgggg aaggtgcgcg cagtggacgc 2520 
cgagctgcag ccagaaacgg ccagcgcgag 2580 
cgagatcagc acaacgcgtg ccctggacga 2640 
gctggtgaaa gaccacgggg agccagcgct 2700 
ggtggagagc ggccaggcgc caaagtcatc 2760 
cgaggtgacg ctggtggatg tcaacgtgta 2820 
cctgttggtt ctcacgctgc tgctgtacac 2880 
gggcgagtgc gcgcctggca aggccgacgc 2940 
cgtactcgca gcagagggag gcagagggtg 3000 
ctcatggcct tcagcccggg cctttcttcc 3060 
aaccctctgc ttcctcagat tcaactggga 3120 
actctgcctc cctgagagca ggcatgcaca 3180 
tacgggctgg tccaggaggg gcctgatcag 3240 
cagaacccag aggcaggaga agtgtcccct 33 00 
cgtggacctt taaatacgga ccaggcaacc 33 60 
gacaaattca ttatcccagg atctcctggc 3420 
cagccacaat tgacaaaagt gacttcataa 3480 
aagaagaaaa agaagaaggg ttaccagacc 3540 
acgtgacaac cagtgaccac gtgaggtcct 3600 
tttgtaataa tgggcaaatc tctcccatgt 3660 
tacattgagc cctcttagag acccgtcaga 3720 
gctagaacgg catttaacac gtttttgtcg 3780 
tctttctctc cactctggct gtgttttcag 3840 
ctgttgctcc tccggccgca ataggagagg 3 900 
ctctctgccc tgtgctccgg ggatcctgtc 39 60 
agttttgaga ttgagcagct tgggagtttg 4020 
tttgggctaa ccccggcggg ggtaattgcc 4080 
gcttagacta attgggtaca agggaaaggc 4140 
aagttatcag tatggagggg aaagtgtaaa 4200 
ccttacaact caagaggtgg cagccacctc 4260 
acaagactat taggagacca ctaaaatctg 4320 
tggcattcat rtacctggcaa ggccaaacag 4380 
gggaaacaga agcagatctg atgtgattcg 4440 
aaattctttt gcacacaatt gtttatggaa 4500 
gcaaaagcaa aagaaaaccc cggacacctt 4560 
atttatttaa aaaaagagaa agtctatagc 4620 
aattccccta aactctgcct caaaagagaa 4680 
ttggactcac ttggnctgct acacgaagtg 4740 
attatatcgg ccaccctgcc caatcacagc 4800 
cctcttgaca tgtattacta aataaaatga 4860 
ctaagaaacc aattatgtgc acctttgata 4920 
atccagaccc caaaaattga agaaaaatat 4980 
ctgcaatcta cttaattctg gtggacttgg 5040 
gtttggttta gttttccctt tctaaaacca 5100 
accctatgaa tgttaccccg agaatcccat 5160 
gcttagactc cctggaataa taacttactt 5220 
gccactatta cctcagagtg aactttaagc 5280 
tattcccttt gtgacaacct cgtggaaaaa 5340 
aatacagact tttgtgtatg aaagacccca 5400 
gtgattcaag gaattttatt tatggtccag 5460 
tggcaagggg agtggataaa gctgttttgg 5520 
acaatattcc atgaagggtg tgcaagcaca 5580 
gcagtaatgc tttgaaattt ctaatgggta 5640 
aaaatttgct gtcaccttaa ataagacatt 5700 
agtttgatta attatattat ctatttaggc 5760 
ttaaaaaaag ccatttaatt taaaaaaaaa 5820 
ccatattttg gaagggtgtg taaactttca 5880 
tgttacccaa cacatgtgaa ccatttgcta 5940 
tgaagcccca tgtatcctga cccttaacgt 6000 
ttattaatga tgataattat aatgtatctg 6060 
ttccaatcca tgttagttac tagttattac 6120 
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cagctgtaaa ggagttaaac acctcaagtg gaatcatttt gaaattggtg ctaattggta 6180 
tttcctcctg ttatctgcta ataaatgaaa aatggtggta tgaaaaaaaa 623 0 

<210> 43 

<211> 2940 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 007302 . 1 :2000MAY01 
<400> 43 

aagaatttgg actcatatca agatgctctg aagaagaaca accctttagg atagccactg 60 
caacatcatg accaaagaca aagaacctat tgttaaaagc ttccattttg tttgccttat 120 
gatcataata gttggaacca gaatccagtt ctccgacgga aatgaatttg cagtagacaa 180 
gtcaaaaaga ggtcttattc atgttccaaa agacctaccg ctgaaaacca aagtcttaga 240 
tatgtctcag aactacatcg ctgagcttca ggtctctgac atgagctttc tatcagagtt 3 00 
gacagttttg agactttccc ataacagaat ccagctactt gatttaagtg ttttcaagtt 3 60 
caaccaggat ttagaatatt tggatttatc tcataatcag ttgcaaaaga tatcctgcca 420 
tcctattgtg agtttcaggc atttagatct ctcattcaat gatttcaagg ccctgcccat 480 
ctgtaaggaa tttggcaact tatcacaact gaatttcttg ggattgagtg ctatgaagct 540 
gcaaaaatta gatttgctgc caattgctca cttgcatcta agttatatcc ttctggattt 600 
aagaaattat tatataaaag aaaatgagac agaaagtcta caaattctga atgcaaaaac 660 
ccttcacctt gtttttcacc caactagttt attcgctatc caagtgaaca tatcagttaa 720 
tactttaggg tgcttacaac tgactaatat taaattgaat gatgacaact gtcaagtttt 780 
cattaaattt ttatcagaac tcaccagagg tccaacctta ctgaatttta ccctcaacca 840 
catagaaacg acttggaaat gcctggtcag agtctttcaa tttctttggc ccaaacctgt 900 
ggaatatctc aatatttaca atttaacaat aattgaaagc attcgtgaag aagattttac 960 
ttattctaaa acgacattga aagcattgac aatagaacat atcacgaacc aagtttttct 1020 
gttttcacag acagctttgt acaccgtgtt ttctgagatg aacattatga tgttaaccat 1080 
ttcagataca ccttttatac acatgctgtg tcctcatgca ccaagcacat tcaagttttt 1140 
gaactttacc cagaacgttt tcacagatag tatttttgaa aaatgttcca cgttagttaa 1200 
attggagaca cttatcttac aaaagaatgg attaaaagac cttttcaaag taggtctcat 1260 
gacgaaggat atgccttctt tggaaatact ggatgttagc tggaattctt tggaatctgg 1320 
tagacataaa gaaaactgca cttgggttga gagtatagtg gtgttaaatt tgtcttcaaa 1380 
tatgcttact gactctgttt tcagatgttt acctcccagg atcaaggtac ttgatcttca 1440 
cagcaataaa ataaagagcg ttcctaaaca agtcgtaaaa ctggaagctt tgcaagaact 1500 
caatgttgct ttcaattctt taactgacct tcctggatgt ggcagcttta gcagcctttc 1560 
tgtattgatc attgatcaca attcagtttc . ccacccatcg gctgatttct tccagagctg 1620 
ccagaagatg aggtcaataa aagcagggga caatccattc caatgtacct gtgagctaag 1680 
agaatttgtc aaaaatatag accaagtatc aagtgaagtg ttagagggct ggcctgattc 1740 
ttataagtgt gactacccag aaagttatag aggaagccca ctaaaggact ttcacatgtc 1800 
tgaattatcc tgcaacataa ctctgctgat cgtcaccatc ggtgccacca tgctggtgtt 1860 
ggctgtgact gtgacctccc tctgcatcta cttggatctg ccctggtatc tcaggatggt 1920 
gtgccagtgg acccagactc ggcgcagggc caggaacata cccttagaag aactccaaag 1980 
aaacctccag tttcatgctt ttatttcata tagtgaacat gattctgcct gggtgaaaag 2040 
tgaattggta ccttacctag aaaaagaaga tatacagatt tgtcttcatg agaggaactt 2100 
tgtccctggc aagagcattg tggaaaatat catcaactgc attgagaaga gttacaagtc 2160 
catctttgtt ttgtctccca actttgtcca gagtgagtgg tgccattacg aactctattt 2220 
tgcccatcac aatctctttc atgaaggatc taataactta atcctcatct tactggaacc 2280 
cattccacag aacagcattc ccaacaagta ccacaagctg aaggctctca tgacgcagcg 2340 
gacttatttg cagtggccca aggagaaaag caaacgtggg gctcttttgg gctaacatta 2400 
gagccgcttt taatatgaaa ttaacactag tcactgaaaa caatgatgtg aaatcttaaa 2460 
aaaatttagg aaattcaact taagaaacca ttatttactt ggatgatggt gaatagtaca 2520 
gtcgtaagta actgtctgga ggtgcctcca ttatcctcat gccttcagga aagacttaac 2580 
aaaaacaatg tttcatctgg ggaactgagc taggcggtga ggttagcctg ccagttagag 2640 
acagcccagt ctcttctggt ttaatcatta tgtttcaaat tggaaacagt ctcttttgag 2700 
taaafcgctca gtttttcagc tcctctccac tctgctttcc caaatggatt ctgttgtgag 2760 
caagagttta tatggcttca tggcagcaag ggaacagtca acttcagcat catatgcacc 2820 
agtcctcgga gtgccctgtg aatcatattg gtctttgggt cagtgtcatc attctcttca 2880 
agtctggggc ttggggaaaa aattagatca gctacggcat ataaaaaagt cttttgtttc 2940 

<210> 44 

<211> 4438 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 23 6386 . 4:200 0MAY01 
<220> 

<221> unsure 
<222> 3774 

<223> a, t, c, g, or other 
<400> 44 

taagcctcag tccttgtttt cccggcctgg ctcgttgtga agccggacac atccaccctt 60 
ggactcgatt caggcggctg ctgcttttct ccttgcccct cttggatttt ccggattttt 120 
gaaaacccag tggcccagga gcaagaggag gaaggaggaa ggggcagatc tgcagaggaa 180 
tgtgagagcc tcccaaagcg agagccgcca aaagaatctg ggagccagag ggacatccga 240 
gccctgcccg ggtttctgga atggtggttt cagagtgagt ctcttctatt ttagaacgtt 300 
gttccagtgg aaagtgtcga atttttcccc tcgcagggca gatttctcca ggtcacttga 360 
cttttcttct gggagtagga gttaggagag attcccctct aaccccccag aggctgctaa 420 
gggaggagga gactgtggac atgagccctc cctgctcaca agcatatgcc cggagacctg 480 
atagggcagt ttctgggcca tggacattgc tttgaagagg gggagactgg acagcatctg 540 
tgggtgctga gaccccacct taggacctga gagattgaac tgtgtaagcg ccattcagct 600 
gcgagtgcat tcttggactg ccttgtgagc atccccggtc tgggcaggac cctctccttc 660 
ccatctttct ataccaccca gcccagccat ggcactgaaa ggccgagccc tctatgactt 720 
tcacagtgag aacaaggagg aaatcagcat ccagcaggat gaggacctgg tcatctttaa 780 
cgagaactca cttggattgg ttggcttgca gggccaaaac agccgtgggg agacagggct 840 
ctttcctgcc tcttatgtgg agatcgtccg ttctggcatc agcaccaacc atgctgacta 900 
ctccagcagc cctgcaggct ctcccggagc ccaggtgagc ttgtacaaca gccccagtgt 960 
ggccagccca gctaggagtg gtgggggcag tggcttcctc tcaaaaccag ggtagctttg 1020 
aggaggatga tgatgatgac tgggatgact ggtgacgacg gatgcacagt ggtggaggag 1080 
ccacagggct ggtggggctg gcgcacacaa cggggcaacc cgtcccctca accgtgtcct 1140 
agcatggggc cctaccccca gcccagcaca atgcccttcc ggcccaagcc aacaatgtga 1200 
ggcggcagga cagcctggca tctgccaagg cgaggcagtg tggtgggcca gtaacactca 1260 
accgtttctc atgctttgtg cgttctggaa tggaagccct taatcctggg tgatgtgccc 1320 
atgatggcac aagatcgctg agacatactc cattgaaatg ggccctcgtg gcccccagtg 1380 
tgaaggcgca atccccaccc atttgcctgc tctgtggagg accccacaaa acagaccaaa 1440 
ttcaagggca tcaaaagcta catctcctac aagctcacac ccacccatgc tgcctcaccc 1500 
gtctaccggc gctacaaaca ctttgactgg ctctataacc cgcctgctac acaagttcac 1560 
tgtcatctcg gtgccccacc tgcctgagaa gcaggccact ggccgcttcg aggaggactt 1620 
catcgaaaag cggaagcgga gactcatcct ctggatggac cacatgacca gccaccctgt 1680 
gctctcccag tacgaaggct tccagcattt cctcagctgc ctggatgaca agcagtggaa 1740 
gatgggcaaa cgccgggcgg agaaggatga gatggtgggt gccagcttcc tgctcacctt 1800 
ccagatcccc accgagcacc aggacttgca ggacgtggaa gatcgcgtgg acactttcaa 1860 
ggccttcagt aagaagatgg acgacagcgt cctgcagctc agcactgtgg catcagagct 1920 
ggtgcgtaaa catgtggggg gcttcccgca aggaattcca gaacgctggg cagtgccttc 1980 
caggccatca gtcattcctt ccagatggac cccccctttt gctctgaggc cctcaacagt 2040 
gccatttctc acacgggccg tacctatgaa gccatcgggg agatgtttgc tgagcagccc 2100 
aagaatgacc tcttccagat gctggacaca ctgtctctct accagggcct gctctccaac 2160 
ttccctgaca tcatccatct acaaaaaggc gccttcgcca aggtgaagga gagccaacgc 2220 
atgagtgacg agggccgcat ggtgcaggac gaggcagacg gcattcgcag gcgctgccgc 2280 
gtggtgggtt tcgccctgca ggccgagatg aaccacttcc accagcgccg tgagctcgac 2340 
ttcaagcatc atgatgcaga actacttgcg ccagcagatc ctcttctacc agcgggtggg 2400 
ccagcagctg cataagaccc tgcgcatgta tcacaccctc tgaccgcgtg tgcctgggct 2460 
ccctccttca cctgggcctg gtcactgcag tgtactccac tttcacgacc accctatgcc 2520 
agcagtgact gatgaattgg tcagcggtgg cggagataac cggcctgtcc tgcctcctgg 2580 
tagaaggagc tttcaaggag tcatgggtgc ccctgggaaa ttccccactc cttagaagtg 2640 
gggcacagca ggggtgagaa tagagtcagg agccctcgag gccaaggcct gggctgccgg 2700 
tcagtccagt gaaggtcagg ccagggtctc agcctcccct agagcctatt ttgcttgctc 2760 
acctggcgca ctgtgtgcct tatccattca gcagacaccg aggcctgctg cacccttggg 2820 
tcggatgctg ggcaccccag ggctgtgaca tgcctgcctc ttcaggagtc ctcaagtgaa 2880 
ggtcggggtc agacacagac agagtcaact gcagtactga ctgactgctt taaatgacgg 2940 
gatttttgga agctctatag aagggaccac agctattcca ctggtcaggg tagactccat 3 000 
agagtaggct acatttgggg cagtgttttg aagaatctag caaggaccta ggcccagaca 3060 
gtacatgcgg gacgaagaga cttctaccgg gagaggaaca gcatgaggcc aaagttatgg 3120 
agggcttgca aacttctccc tcttctctcc ccttactttc caaggcaagt taggtgacgc 3180 
tttccatggg gattctcggc ctgtgtggta aggaacgagg atctcccttg ctccccatgt 3240 
agctggtctg tccgtgacat caccctgtcc cctgcaggag ggggctacag gccatctccc 3300 
ttcctgtagg cctctgactc ccctcccact tttggggccc tcagcttatc tcgcgcatgg 33 60 
ggaccattcg cagcatcctc gccctcctgc ggacfccaaga tccatgagat ataagccctg 3420 
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ggccccagat ccctggtgac accttccttg gagaagactc tcaaaagtga ctgtatattt 3480 
gagttcacca gcaataactc cccacacttc gaagcatggt ccaaacccat ggatcccagg 3540 
gtccttgggc ctctgtgggc actgtcttcc caagatcctt cctgttgcaa caatgggaaa 3 600 
ccttaagagg aaaaagacag gggcctgctt tgcccagccc atgcgaaggg attccatgcc 3 660 
cacctgccct ctgcctgcct cgctggaatg tgggcccctg ctcccccgtc agggtggtgc 3720 
tgtctctgac ctatgtttac gatccccgag gggtttttgg cttccccttc ccanccaggt 3780 
cagggtgtgg ttccagcagc ttgctgtggg gtgctgacat gtgtcaccac tgcccccctt 3840 
gtcccccggg ggggtcatgg tctcctcctg gatgctgctc cttgaatctt ttttcttgat 3 900 
aaacctttta caattaagat aacacaagca tgactttttc tgtttggatc ccagaaaggc 3960 
ggagggcagg agaaggatag agccctaatt gctcctgaga gccattggat gagattctga 4020 
ggtcgtggtg ggcacaaatt ttccacagaa cctcaaaagt tcaggggagg gctatgctgg 4080 
tggaaggtgc cagcaggcag gaggagctag aggcggctgt ggacccctgg gtggatccat 4140 
ccctccctag aacgcactct tgtctctaaa acaggtggag tgctgcccag gggactggct 4200 
gtactgcctt gtgatctggg gctgagggtt gtatgaggaa gggacaggac gctgtgccct 4260 
aggacaatta atagatggtg gctcctctcc ccaaggagcc atgccctggc cttgcccttg 4320 
aaaagcccta gtccagggga gggaagtggg ggactcagaa gctgtgtctc ttccccaaac 4380 
cgtcctgggt acccagccct gcggaggtcc cacattggaa ctgaagagga cgctggct 4438 

<210> 45 

<211> 987 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 252904 . 5 :2000MAY01 
<400> 45 

cccgacttca gccccagcca gatcccgcgt caacggaggc ggaacggcgg accccgtacc 60 

ctggcagcat cggagcaccg gcgggtgaag gcaaggtccc tggactggtc atatacctct 120 

tgtggccctg gcagaatcaa gatgaggccc tgtcatgcct ccccagtgag gcctacagtc 180 

tgagcagaca gcatggcctg ccactggcag tgaacaccat gtctgcagga ggtggccggg 240 

cctttgcttg atggtatggt gtatgctctg gggggaatgg gccctgacac ggccccccag 300 

gcccaggtac gtgtgtatga gccccgtcgg gactgctggc tttcgctacc ctccatgccc 3 60 

acaccctgct atggggcctc caccttcctg cacgggaaca agatctatgt cctggggggc 420 

cgccagggca agctcccggt gactgctttt gaagcctttg atctggaggc ccgtacatgg 480 

acccggcatc caagcctacc cagccgtcgg gcctttgctg gctgcgccat ggctgaaggc 540 

agcgtcttta gcctgggtgg cctgcagcag cctgggcccc acaacttcta ctctcgccca 600 

cactttgtca acactgtgga gatgtttgac ctggagcatg ggtcctggac caaattgccc 660 

cgcagcctgc gcatgaggga taagagggca gactttgtgg ttgggtccct tgggggccac 720 

attgtggcca ttgggggcct tggaaaccag ccatgtcctt tgggctctgt ggagagcttt 780 

agccttgcac ggcggcgctg ggaggcattg cctgccatgc ccactgcccg ctgctcctgc 840 

tctagtctgc aggctgggcc ccggctgttt gttattgggg gtgtggccca gggccccagt 900 

caagccgtgg aggcactgtg tctgcgtgat ggggtctgaa ggcttggtgg agctgtccac 960 

tgagcagctc attggggatc cactagt 987 

<210> 46 
<211> 263 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 977683 . 1 . orf 3 : 2000FEB18 
<400> 46 

Gly Ser Asp Met Ala Ala Asp Leu Asn Leu Glu Trp lie Ser Leu 
1 5 10 15 

Pro Arg Ser Trp Thr Tyr Gly lie Thr Arg Gly Gly Arg Val Phe 

20 25 30 

Phe lie Asn Glu Glu Ala Lys Ser Thr Thr Trp Leu His Pro Val 

35 40 45 

Thr Gly Glu Ala Val Val Thr Gly His Arg Arg Gin Ser Thr Asp 

50 55 60 

Leu Pro Thr Gly Trp Glu Glu Ala Tyr Thr Phe Glu Gly Ala Arg 

65 70 75 

Tyr Tyr He Asn His Asn Glu Arg Lys Val Thr Cys Lys His Pro 
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80 


Val 


Thr 


Gly 


Gin 


Pro 








95 


Glu 


Gin 


Thr 


Val 


Ala 
110 


Pro 


He 


Ser 


Met 


lie 
125 


Asp 


Tyr 


Ala 


Val 


His 
140 


Ala 


Ser 


Lys 


Lys 


Val 

tec 

155 


Lys 


Arg 


Asn 


Pro 


Asn 
17 0 


Lys 


Gin 


Asp 


Ser. 


Thr 
185 


Val 


Leu 


Ser 


Asp 


Leu 








200 


Glu 


Gly 


He 


Leu Gly 










215 


Phe 


Ala 


Tyr 


Pro 


Leu 








230 


Arg 


Gin 


Pro 


He 


Gin 
245 


Gin 


Glu 


Arg 


Lys 


Trp 
260 









85 


Ser 


Gin 


Asp Asn 


Cys 
100 


Thr 


Met 


Thr Ser 


Glu 
115 


Asn 


Glu 


Ala Ser 


Asn 
130 


Pro 


Met 


Ser Pro 


Val 
145 


His 


Asn 


Phe Gly 


Lys 
160 


Ala 


Pro 


Val Val 


Arg 
175 


Gly 


Met 


Lys Leu 


Trp 
190 


Cys 


Leu 


Phe Tyr 


Tyr 
205 


Ser 


He 


Leu Leu 


Pro 

O O f\ 

220 


Lys 


He 


inr Lieu 


HQ 






235 


Thr 


Cys 


Gly Pro 


He 
250 


Ser 


Cys 


Gly 











He 


Phe 


Val Val Asn 






10b 


Glu 


Lys 


Lys Glu Arg 






ion 

1/5 U 


Tyr 


Asn 


Val Thr Ser 




13b 


b-iy 


Arg 


Thr Ser Arg 






150 


Arg 


Ser 


Asn Ser He 




165 


Arg 


Gly Trp Leu Tyr 






180 


Lys 


Lys 


Arg Trp Phe 






195 


Arg 


Asp 


Glu Lys Glu 






210 


Ser 


Phe 


Gin He Ser 






225 


Ala 


Asn 


Met Leu Leu 






240 


He 


Ser 


Ala Leu He 






255 



<210> 47 

<211> 217 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 893050 . 1 .orfl : 2000FEB18 



<400> 47 












Ser 


Leu Pro 


Ser Thr Ser 


Phe 


Arg Val Ser 


Ser 


Leu Phe Ser Gly 


1 




5 




10 




15 


His 


Leu Glu 


Val Leu Lys 


Leu 


Leu Val Ala 


Arg 


Gly Ala Asp Leu 






20 




25 




30 


Gly 


Cys Lys 


Ala Arg Lys 


Gly 


Tyr Gly Leu 


Leu 


His Thr Ala Ala 






35 




40 




45 


Ala 


Ser Gly 


Gin He Glu 


Val 


Val Lys Tyr 


Leu 


Leu Arg Met Gly 




50 




55 




60 


Ala 


Glu He 


Asp Glu Pro 


Asn 


Ala Phe Gly 


Asn 


Thr Ala Leu His 






65 




70 




75 


He 


Ala Cys 


Tyr Leu Gly 


Gin 


Asp Ala Val 


Ala 


He Glu Leu Val 




80 




85 




90 


Asri 


Ala Gly 


Ala Asn Val 


Asn 


Gin Pro Asn 


Asp 


Lys Gly Phe Thr 




95 




100 




105 


Pro 


Leu His 


Val Ala Ala 


Val 


Ser Thr Asn 


Gly 


Ala Leu Cys Leu 






110 




115 




120 


Glu 


Leu Leu 


Val Asn Asn 


Gly 


Ala Asp Val 


Asn 


Tyr Gin Ser Lys 






125 




130 




135 


Glu 


Gly Lys 


Ser Pro Leu 


His 


Met Ala Ala 


He 


His Gly Arg Phe 




140 




145 




150 


Thr 


Arg Ser 


Gin He Leu 


He 


Gin Asn Gly 


Ser 


Glu He Asp Cys 






155 




160 




165 


Ala 


Asp Lys 


Phe Gly Asn 


Thr 


Pro Leu His 


Val 


Ala Ala Arg Tyr 






170 




175 




180 


Gly 


His Glu 


Leu Leu He 


Ser 


Thr Leu Met 


Thr 


Asn Gly Ala Asp 






185 




190 




195 


Thr 


Gly Arg 


Arg Gly He 


His 


Asp Met Phe 


Pro 


Leu His Leu Ala 






200 




205 




210 


Val 


Leu Phe 


Gly Phe Ser 


Asp 












215 
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<210> 48 
<211> 716 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 980153 . 1 .orfl :2000FEB18 
<220> 

<221> unsure 
<222> 683 

<223> unknown or other 
<400> 48 



Gin 


Arg 


Gly 


Ala 


Lys 


Thr 


Arg 


Leu Arg 


Pro 


Phe 


Ser Pro 


xxx y ii x o 


1 








5 








10 






15 


Cvs 


TVr 


Lys 


Ala 


Ala 


Thr 


He 


Lys Asp Val 


Phe 


Gly Arg 


Aqti Ala 










20 








25 






30 


Leu 


His 


Pro 


Cys 


Phe 


Leu 


Leu 


Val Glu 


Lys 


Lys 


niv Val 

VsXjr V CI X 


xifejtx rio u 










35 








40 






45 




Leu 


Tie 

X Xc 




XJ jr o 




Val 

V CI J. 


Asp Leu Leu 


Val 

V CLJL 




t .v^ nln 


















55 








Cp-r 




X X^J 


X11X 


Al a 


T.oi l 


nib 


Arg Ser 


He 


irJX*= 


ijyi \jj.y 


nib lie 










65 








70 






75 


Asp 


Cys 


Val 






T.oi i 




Lys His 


Gly 


Val 


Qor T.Dii 
Oci Xicli 


XjfX lie 


















85 








Gin 


Asp 


Lys 


Gin 




ucu 


Coy- 


Ala Leu 


Asp 


ucu 


Val Mf=>f- 


Fa^g Sen 
XJy o rio 










95 








100 






X u _» 


Ai. y 




J. JXX 


His 


Va 1 

VCLX 


V da. 


IrXlti 


Lys Asn 


Thr 




XT X \J XXIX 


A cm Va 1 
riojj val 










110 






115 




120 


iyi 


.nix 


1 J-P 


V7 _L y 


A cn 
flop 


A en 


XilX 


Asn Phe 


Thr 


Ucu 


Hi v Hi «^ 
xaxy nxo 


^riy Del 










125 

JL a ^ 








130 






X J *J 


Gin 




O triJL 


J-Jjr o 


Hi C3 


Hi «5 




Glu Leu 


Val 




JjcU. iri.lt! 


otrl rax y 










140 








145 




150 

J 1 Kf 


Ser 


Glv 


He 




He 


Lys 


(51 Ti 

V3 XH 


Val Val 


Leu 


Cys 


xj y o r^iic 


Hi =J QciT" 
nxd oei 










155 








160 






165 


Val 


Phe 


Leu 


Ser 


Gin 


Lys 


Gly 


Gin Val 


Tyr 


Thr 


Cys Gly 


Hi «; pJv 










170 








175 






180 


Pro 


Glv 


Glv 


Arg 


Leu 


Gly 


His 


Gly Asp 


Glu 


Gin 


Thr Cys 


Leu Val 










185 








190 






195 


Pro 


Arg 


Leu 


Val 


Glu 


Glv 


Leu 


Asn Gly His 


Asn 


Cvs Ser 


Gin Val 










200 








205 






210 


Ala 


Ala 


Ala 


Lys 


Asp 


His 


Thr 


Val Val 


Leu 


Thr 


GlU ASD 


Glv Cvs 










215 








220 






225 


Val 


Tvr 


Thr 


Phe 


Glv 


Leu 


Asn 


He Phe 


His 


Gin 


Leu Gly 


He He 










230 








235 




240 


Pro 


Pro 


Pro 


Ser 


Ser 


Cys 


Asn 


Val Pro 


Arg 


Gin 


He Gin 


Ala Lys 










245 








250 






255 


Tyr 


Leu 


Lys 


Gly 


Arg 


Thr 


He 


He Gly Val 


Ala 


Ala Gly 


Arg Phe 










260 








265 






270 


His 


Thr 


Val 


Leu 


Trp 


Thr 


Arg 


Glu Ala 


Val 


Tyr 


Thr Met 


Gly Leu 










275 








280 






285 


Asn 


Gly 


Gly 


Gin 


Leu 


Gly 


Cys 


Leu Leu Asp 


Pro 


Asn Gly 


Glu Lys 










290 








295 






300 


Cys 


Val 


Thr 


Ala 


Pro 


Arg 


Gin 


Val Ser 


Ala 


Leu 


His His 


Lys Asp 










305 








310 






315 


He 


Ala 


Leu 


Ser 


Leu 


Val 


Ala 


Ala Ser Asp 


Gly 


Ala Thr 


Val Cys 










320 








325 






330 


Val 


Thr 


Thr 


Arg 


Gly 


Asp 


He 


Tyr Leu 


Leu 


Ala 


Asp Tyr 


Gin Cys 










335 








340 






345 


Lys 


Lys 


Met 


Ala 


Ser 


Lys 


Gin 


Leu Asn 


Leu 


Lys 


Lys Val 


Leu Val 










350 








355 






360 


Ser 


Gly 


Gly 


His 


Met 


Glu 


Tyr 


Lys Val Asp 


Pro 


Glu His 


Leu Lys 










365 






370 






375 


Glu 


Asn 


Gly 


Gly 


Gin 


Lys 


He 


Cys He 


Leu 


Ala 


Met Asp 


Gly Ala 



380 385 390 



35/69 



WO 01/62922 



PCT/US01/05896 



Gly Arg 


Val 


Phe 


Cys 




Arg 


Ser 








395 








Cvs Ara 


Trp 


Ala 


Tyr 


Pro 


Ara 


Gin 








410 








Leu Asn 


Arg 


Asn 


Glu 


lie 


Leu 


Phe 








425 








Phe Arg 


Glv 


Arg Trp 


Phe 


Glu 


Glu 








440 








Lys Glu 


lie 


Leu 


Ser 


Asn 


Leu 


His 








455 








Tyr Val 


Ser 


Asp 


lie 


Asn 


Ser 


Val 








470 








Lys Leu 


Thr 


Phe 


Ala 


His 


Arg 


Ala 








485 








Ser Gly 


Cvs 


Asn 


Phe 


Ala 


lie 


Leu 








500 








Leu Tyr 


Glu 


lie 


Pro 


Ala 


Val 


Ser 








515 








Phe Gly 


Lvs 


Leu 


Leu 


Arg 


GlU 


Ala 








53 0 






Asp Val 


Thr 


Phe 


Gin 


Val 


Glv 


Asn 








545 






Tyr lie 


Leu 


Ala 


Val 


His 


Ser 


Asp 








560 








Ser Asp 


Glv 


Asn 


Thr 


Ser 


Glu 


Phe 








575 








Glu Asp 


Ser 


Ala Gly 


Cvs 


His 


Leu 








590 








Pro Asp 


Met 


Phe 


Glu 


Tvr 


Leu 


Leu 








605 








Cys Asp 


Phe 


Leu 


Thr 


His 


Glv 


Phe 








620 








Lvs Asn 


Pro 


Glu 


Glu 


Tvr 
J.J/J. 


Gin 


Glv 








635 








Lys Val 


Asn 


Phe 


His 


GlU 


Asp 


Asp 








650 






Val Tyr 


Lys 


Ser 


Asn 


Gin 


Ala 


Gin 








665 








Ser Lys 


Pro 


Lys 


Ser 


Cys 


Lys 


Xaa 








680 








Asp Pro 


Val 


Arg 


Met 


Leu 


Gin 


Thr 








695 








Ser Asn 


Leu 


Ser 


Ser 


Arg 


Leu 


Asp 



710 

<210> 49 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG:350398.1 

<220> 

<221> unsure 
<222> 22 

<223> unknown or other 
<400> 49 

Glu Pro Leu Ser Pro Pro Gly Arg 

1 5 
Cys Glu Gly Pro Gin Gly Xaa Phe 
20 

Arg Phe Leu Leu Arg Tyr Trp His 
35 

Ala He Val Pro Val Trp Ser Pro 



Val 


Asn 


Ser 


Ser Leu Lys 


Gin 




400 




405 


Val 


Phe 


He 


Ser Asp He 


Ala 




415 




420 


Val 


Thr 


Gin 


Asp Gly Glu 


Gly 




430 






435 


Lvs 


Ara 


Lvs 


Ser Ser Glu 


Lys 




445 






450 


Asn 


Ser 


Ser 


Ser Asp Val 


Ser 




460 




465 


Tvr 


Glu 


Ara 


He Ara Leu 


Glu 




475 






480 


Val 


Ser 


Val 


S er Thr Asn 


Pro 




490 




495 


Gin 


Ser 


Asp 


Pro Lvs Thr 


Ser 




505 






510 


Ser 


Ser 


Ser 


Phe Phe Glu 


Glu 




520 






525 


Asp 


Glu 


Met 


Asd Ser He 


His 




535 






540 


Arg 


Leu 


Phe 


Pro Ala His 


Lvs 




550 






555 


Phe 


Phe 


Gin 


Lys Leu Phe 


Leu 




565 






570 


Thr 


Asp 


He 


Tvr Gin Lvs 


Asp 




580 






585 


Phe 


Val 


Val 


Glu Lvs Val 


His 




595 




600 


Gin 


Phe 


He 


•Ljr<U llll AD]J 


Thr 




610 






61 5 


Lys 


Pro 


Arg 


Tie* Hi s Lpvi 


Asn 




625 






630 


Thr 


Leu 


Asn 


53eT His L^ii 


Asn 




640 






645 


Asn 


Gin 


Lys 


Ser Ala Phe 


Glu 




655 




660 


Thr 


Val 


Ser 


Glu Ara Gin 


Lys 




670 






675 


Glv 


Lys 


Asn 


lie Ara Glu 


Asp 




685 






690 


Val 


Ala 


Lys 


Lys Phe Asp 


Phe 




700 






705 


Gly 


Val 


Arg 








715 









orf3 :2000FEB18 



He Pro Gly Ala Ala Gly Glu 
10 15 

Ala Ser Arg Gin Pro Tyr Ser 
25 30 

Leu Thr Pro He Thr Pro Trp 
40 45 

Arg Gly Arg Ser Arg Gly Ser 
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50 55 60 

Pro Asn Ser Thr Ser Gin Thr Ser lie Gin Ala Gly Thr Ser Thr 

65 70 75 

Lea Leu Ala Ser Arg His Gin Asn lie Trp Glu Asp Met Cys Val 

80 85 90 

Ser Thr Cys Met Trp Gly His Thr Gly Gly Asn Met Gly Met Arg 

95 100 105 

Ala Val 



<210> 50 

<211> 645 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 475551 . 1 . orf 3 : 2000FEB18 
<220> 

<221> unsure 
<222> 141 

<223> unknown or other 



<400> 50 






















Leu Gin 


Gly 


Gin 


Ser 


Gly 


Ala 


Astd 


Met 


Asp 


Lvs 


Arg Val Lys 


Lvs 


1 






5 










10 






15 


Leu Pro 


Leu 


Met 


Ala 


Leu 


Ser 


Thr 


Thr 


Met 


Ala 


Glu Ser Phe 


Lvs 








20 










25 






30 


Glu Leu 


Asp 


Pro 


Asp 


Ser 


Ser 


Met 


Gly Lys 


Ala 


Leu Glu Met 


Ser 








35 










40 






45 


Cys Ala 


He 


Gin 


Asn 


Gin 


Leu 


Ala 


Arg 


He 


Leu 


Ala Glu Phe 


Glu 








50 








55 






60 


Met Thr 


Leu 


Glu 


Arg 


Asp 


Val 


Leu 


Gin 


Pro 


Leu 


Ser Arg Leu 


Ser 








65 










70 






75 


Glu Glu 


Glu 


Leu 


Pro 


Ala 


lie 


Leu 


Lys 


His 


Lys 


Lys Ser Leu 


Gin 








80 










85 






90 


Lys Leu 


Val 


Ser 


Asp 


Trp 


Asn 


Thr 


Leu 


Lys 


Asn 


Arg Leu Ser 


Gin 








95 










100 






105 


Ala Thr 


Lys 


Asn 


Ser 


Gly 


Ser 


Ser 


Gin Gly 


Leu 


Gly Gly Ser 


Pro 








110 










115 






120 


Gly Ser 


His 


Ser 


His 


Thr 


Thr 


Met 


Ala 


Asn 


Lys 


Val Glu Thr 


Leu 








125 










13 0 




13 5 


Phe Tyr 


Cys 


Ser 


Arg 


Xaa 


Ser 


Pro 


Arg 


Lys 


Val 


Glu Gin Cys 


Arg 








140 










145 






150 


Asp Glu 


Tyr 


Leu 


Ala 


Asp 


Leu 


Tyr 


His 


Phe 


Val 


Thr Lys Glu 


Asp 








155 










160 






165 


Ser Tyr 


Ala 


Asn 


Tyr 


Phe 


He 


Arg 


Leu 


Leu 


Glu 


He Gin Ala 


Asp 








170 










175 






180 


Tyr His 


Arg 


Arg 


Ser 


Leu 


Ser 


Ser 


Leu 


Asp 


Thr 


Ala Leu Ala 


Glu 








185 










190 






195 


Leu Arg 


Glu 


Asn 


His 


Gly 


Gin 


Ala 


Asp 


His 


Ser 


Pro Ser Met 


Thr 








200 






205 






210 


Ala Thr 


His 


Phe 


Pro 


Arg 


Val 


Tyr 


Gly Val 


Ser 


Leu Ala Thr 


His 








215 










220 






225 


Leu Gin 


Glu 


Leu 


Gly 


Arg 


Glu 


He 


Ala 


Leu 


Pro 


He Glu Ala 


Cys 








230 








235 






240 


Val Met 


Met 


Leu 


Leu 


Ser 


Glu 


Gly 


Met 


Lys 


Glu 


Glu Gly Leu 


Phe 








245 










250 






255 


Arg Leu 


Ala 


Ala 


Gly 


Ala 


Ser 


Val 


Leu Lys 


Arg 


Leu Lys Gin 


Thr 








260 










265 






270 


Met Ala 


Ser 


Asp 


Pro 


His 


Ser 


Leu 


Glu 


GlU 


Phe 


Cys Ser Asp 


Pro 








275 










280 






285 


His Ala 


Val 


Ala 


Gly 


Ala 


Leu 


Lys 


Ser 


Tyr 


Leu 


Arg Glu Leu 


Pro 








290 










295 






300 


Glu Pro 


Leu 


Met 


Thr 


Phe 


Asp 


Leu 


Tyr 


Asp 


Asp 


Trp Met Arg 


Ala 








305 










310 






315 
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Ala 


Ser 


Leu 


Lys 


Glu 


Pro 


Gly 


Ala Arg 


Leu 


Gin 


Ala 


Leu 


Gin 


Glu 










320 










325 










330 


Val 


Cys 


Ser Arg 


Leu 


Pro 


Pro 


Glu 


Asn 


Leu 


Ser 


Asn 


Leu Arg 


Tyr 










335 










340 










345 


Leu 


Met 


Lys 


Phe 


Leu 
350 


Ala 


Arg 


Leu 


Ala 


Glu 
355 


Glu 


Gin 


Glu 


Val 


Asn 
360 


Lys 


Met 


Thr 


Pro 


Ser 


Asn 


He 


Ala 


He 


Val 


Leu 


Gly Pro Asn 


Leu 










3 65 










370 










375 


Leu 


Trp 


Pro 


Pro 


Glu 


Lys 


Glu 


Gly Asp 


Gin 


Ala 


Gin Leu Asp 


Ala 










380 










385 










390 


Ala 


Ser 


Val 


Ser 


Ser 
395 


He 


Gin 


Val 


Val 


Gly 
400 


Val 


Val 


Glu 


Ala 


Leu 
405 


lie 


Gin 


Ser 


Ala 


Asp 
410 


Thr 


Leu 


Phe 


Pro 


Gly 
415 


Asp 


He 


Asn 


Phe 


Asn 
420 


Val 


Ser 


Gly Leu 


Phe 


Ser 


Ala 


Val 


Thr 


Leu 


Gin 


Asp 


Thr 


Val 


Ser 










425 










430 










435 


Asp 


Arg 


Leu 


Ala 


Ser 
440 


Glu 


Glu 


Leu 


Pro 


Ser 
445 


Thr 


Ala 


Val 


Pro 


Thr 
450 


Pro 


Ala 


Thr 


Thr 


Pro 
455 


Ala 


Pro 


Ala 


Pro 


Ala 
460 


Pro 


Ala 


Pro 


Ala 


Pro 
465 


Ala 


Pro 


Ala 


Leu 


Ala 
470 


Ser 


Ala 


Ala 


Thr 


Lys 
475 


Glu 


Arg 


Thr 


GlU 


Ser 
480 


Glu 


Val 


Pro 


Pro 


Arg 
485 


Pro 


Ala 


Ser 


Pro 


Lys 
490 


Val 


Thr 


Arg 


Ser 


Pro 
495 


Pro 


Glu 


Thr 


Ala 


Ala 


Pro 


Val 


Glu Asp 


Met 


Ala 


Arg Arg Thr 


Lys 










500 










505 










510 


Arg 


Pro 


Ala 


Pro 


Ala 
515 


Arg 


Pro 


Thr 


Met 


Pro 
520 


Pro 


Pro 


Gin 


Val 


Ser 
525 


Gly 


Ser 


Arg 


Ser 


Ser 


Pro 


Pro 


A-la 


Pro 


Pro 


Leu 


Pro 


Pro Gly 


Ser 










530 










535 










540 


Gly 


Ser 


Pro Gly 


Thr 


Pro 


Gin 


Ala 


Leu 


Pro 


Arg 


Arg Leu Val 


Gly 










545 










550 










555 


Ser 


Ser 


Leu Arg 


Ala 


Pro 


Thr 


Val 


Pro 


Pro 


Pro 


Leu 


Pro 


Pro 


Thr 










560 










565 










570 


Pro 


Pro 


Gin 


Pro 


Ala 
575 


Arg 


Arg 


Gin 


Ser 


Arg 
580 


Arg 


Ser 


Pro 


Ala 


Ser 
585 


Pro 


Ser 


Pro 


Ala 


Ser 
590 


Pro 


Gly 


Pro 


Ala 


Ser 
595 


Pro 


Ser 


Pro 


Val 


Ser 
600 


Leu 


Ser 


Asn 


Pro 


Ala 
605 


Gin 


Val 


Asp 


Leu 


Gly 
610 


Ala 


Ala 


Thr 


Ala 


Glu 
615 


Gly 


Gly 


Ala 


Pro 


Glu 


Ala 


He 


Ser Gly 


Val 


Pro 


Thr 


Pro 


Pro 


Ala 










620 










625 










63 0 


He 


Pro 


Pro 


Gin 


Pro 
635 


Arg 


Pro 


Arg 


Ser 


Leu 
640 


Ala 


Ser 


Glu 


Thr 


Asn 
645 



<210> 51 

<211> 177 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 481407 . 2 . orf 3 : 2000FEB18 
<400> 51 



Cys Gin Gly 


Arg 


Cys 


Glu 


Arg 


Leu Arg Arg 


Val 


Gly Val Glu 


Pro 


1 




5 






10 






15 


Gin Leu Ser 


Arg 


Gly 


Leu 


Ala 


Leu Phe Trp 


Ser 


Pro Arg Pro 


Asn 






20 






25 






30 


Pro Pro Glu 


Glu 


Met 


Ser 


Gly 


Gly Leu Ala 


Pro 


Ser Lys Ser 


Thr 






35 






40 






45 


Val Tyr Val 


Ser 


Asn 


Leu 


Pro 


Phe Ser Leu 


Thr 


Asn Asn Asp 


Leu 






50 






55 






60 


Tyr Arg He 


Phe 


Ser 


Lys 


Tyr 


Gly Lys Val 


Val 


Lys Val Thr 


He 






65 






70 






75 


Met Lys Asp 


Lys 


Asp 


Thr 


Arg 


Lys Ser Lys 


Gly 


Val Ala Phe 


He 
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80 






85 




90 


Leu Plie Leu 


Asp Lys 


Asd Ser 


Ala Gin 

ii^ 1 -* Will 


Asn 


Cys 


Thr Ara Ala lie 




95 






100 


105 


Asn Asn Lys 


Gin Leu 


Phe Gly 


Arg Val 


lie 


Lvs 


Ala Ser lie Ala 




110 


115 


120 


lie Asp Asn 


Gly Arg 


Ala Ala 


Glu Phe 


lie 


Arg 


Arc Aro Asri Tvr 

X A, J- i Wf Al^4l ^ Jz 




125 






130 




135 


Phe Asp Lys 


Ser Lys 


Cvs Tvr 


Glu Cys 


Glv 


Glu 


Ser Gly His Leu 




140 






145 




150 


Ser Tyr Ala 


Cys Pro 


Lys Asn 


Met Leu 


Gly 


Glu 


Arg Glu Pro Pro 




155 






160 




165 


Lys Lys Lys 


Glu Lys 


Lys Glu 


Lys Lys 


Glu 


Ser 


Ser 




170 






175 







<210> 52 

<211> 217 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 443 580 . 1 . orf 1 : 2000FEB01 
<400> 52 



Glu 


Thr 


Ser 


Leu 


Arg 


Ser 


Gly 


Gin 


He 


Pro 


Thr 


Leu Asp Ser 


Ser 


i 
± 








c 














15 


Glu 


His 


Asn 


Leu 


Ser 
20 


Pro 


Glu 


Pro 


Leu 


Glu 
25 


Leu 


Asp Arg Met 


Pro 
30 


His 


Ser 


Pro 


Leu 


He 
35 


Ser 


He 


Pro 


His 


Val 
40 


Trp 


Cys His Pro 


Glu 
45 


Glu 


Glu 


Glu Arg 


Met 


His 


Asp 


Glu 


Leu 


Leu 


Gin 


Ala Val. Ser 


Lys 










50 










55 






60 


Gly 


Pro 


Val 


Met 


Phe 
65 


Arg 


Asp 


Val 


Ser 


He 
70 


Asp 


Phe Ser Gin 


Glu 
75 


Glu 


Trp 


Glu 


Cys 


Leu 


Asp 


Ala 


Asp Gin 


Met 


Asn 


Leu Tyr Lys 


Glu 










80 










85 






90 


Val 


Met 


Leu 


Glu 


Asn 
95 


Phe 


Ser 


Asn 


Leu 


Val 
100 


Ser 


Val Gly Leu 


Ser 
105 


Asn 


Ser 


Lys 


Pro 


Ala 


Val 


He 


Ser 


Leu 


Leu 


Glu 


Gin Gly Lys 


Glu 










110 










115 




12 0 


Pro 


Trp 


Met 


Val 


Asp 


Arg 


Glu 


Leu 


Thr 


Arg 


Gly 


Leu Cys Ser Asp 










125 










130 






135 


Leu 


Glu 


Ser 


Met 


Cys 
140 


Glu 


Thr 


Lys 


He 


Leu 
145 


Ser 


Leu Lys Lys 


Arg 
150 


His 


Phe 


Ser 


Gin 


Val 


He 


He 


Thr Arg 


Glu 


Asp 


Met Ser Thr 


Phe 










155 










160 




165 


lie 


Gin 


Pro 


Thr 


Phe 
17 0 


Leu 


He 


Pro 


Pro 


Gin 
175 


Lys 


Thr Met Ser 


Glu 
180 


Glu 


Lys 


Pro 


Trp 


Glu 
185 


Cys 


Lys 


He 


Cys 


Gly 
190 


Lys 


Thr Phe Asn 


Gin 
195 


Asn 


Ser 


Gin 


Phe 


He 


Gin 


His 


Gin Arg 


He 


His 


Phe Gly Glu 


Lys 










200 










205 






210 


His 


Tyr 


Glu 


Ser 


Lys 
215 


Glu 


Lys 















<210> 53 
<211> 151 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 803015 . 1 . orf 3 :2000FEB01 
<400> 53 

Ala Gly Cys Gly Trp Asp Pro Val Phe Pro Ala Pro Arg Gly Thr 
1 5 10 15 
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Trp 


Phe 


Leu Cys 


Pro 


Glv 


Phe 


Cys 


His 


Ser 


Val 


Thr Tyr Ala 


Met 








20 










25 






30 


Pro 


Cvs 


Cys Ser 


His 


Arg 


Arg 


Cys 


Arg 


Glu 


Asp 


Pro Glv Thr 


Ser 








35 










40 






45 


Glu 


Ser 


Gin Glu 


Met 


Asp 


Pro 


Val 


Ala 


Phe 


Asp 


Asp Val Ala 


Val 








50 










55 






60 


Asn 


Piie 


Thr Gin 


Glu 


Glu 


Tro 


Ala 


Leu 


Leu 


Asp 


He Ser Gin 


Arg 








65 








70 




75 


Lys 


Leu 


Tyr Lys 


GlU 


Val 


Met 


Leu 


Glu 


Thr 


Phe 


Arg Asn Leu 


Thr 








80 










85 






90 


Ser 


Val 


Gly Lvs 


Ser 


Tro 


Lvs 


Asp 


Gin 


Asn 


lie 


Glu Tyr Glu 


Tvr 








95 










100 






105 


Gin 


Asn 


Pro Arg 


Arg 


Asn 


Phe 


Arg 


Ser 


Leu 


He 


Glu Lys Lys 


Val 








110 










115 






120 


Asn 


Glu 


lie Lys 


Asp 


Asp 


Ser 


His 


Cys 


Gly 


Glu 


Thr Phe Thr 


Gin 








125 










130 






135 


Val 


Pro 


Asp Asp 


Arg 


Leu 


Asn 


Phe 


Gin 


Glu 


Lys 


Lys Ala Ser 


Pro 



140 145 150 

Glu 



<210> 54 

<211> 193 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG : 027410 . 3 . orf 3 : 2000MAY19 
<400> 54 



His 


Thr 


Glu 


Ala Arg 


Pro 


Pro 


Arg 


Arg 


Glu 


Ser 


Trp He Ser 


Asp 


1 






5 










10 






15 


He Arg 


Ala 


Gly Thr 


Ala 


Pro 


Ser 


Cys 


Arg 


Asn 


His lie Lys 


Ser 








20 










25 






30 


Ser 


Cys 


Ser 


Leu He 


Ala 


Phe 


Asn 


Ser 


Asp 


Arg 


Pro Gly Val 


Leu 








35 










40 






45 


Gly He 


Val 


Pro Leu 


Gin 


Gly 


Gin 


Gly 


Glu 


Asp 


Lys Arg Arg 


Val 








50 










55 






60 


Ala 


His 


Leu 


Gly Cys 


His 


Ser 


Asp 


Leu 


Val 


Thr 


Asp Leu Asp 


Phe 








65 










70 






75 


Ser 


Pro 


Phe 


Asp Asp 


Phe 


Leu 


Leu 


Ala 


Thr 


Gly 


Ser Ala Asp 


Arg 








80 










85 






90 


Thr 


Val 


Lys 


Leu Trp 


Arg 


Leu 


Pro 


Gly 


Pro 


Gly 


Gin Ala Leu 


Pro 








95 










100 






105 


Ser 


Ala 


Pro 


Gly Val 


Val 


Leu 


Gly 


Pro 


Glu 


Asp 


Leu Pro Val 


Glu 








110 










115 




120 


Val 


Leu 


Gin 


Phe His 


Pro 


Thr 


Ser 


Asp 


Gly 


He 


Leu Ser Trp 


Gin 








125 










130 






135 


Pro 


Met 


Gly 


Thr Trp 


Cys 


Arg 


Ala 


Pro 


Ser 


Gly 


Ala Glu Met 


Glu 








140 










145 




150 


Pro 


Trp 


Trp 


Ala Arg 


Arg 


Ala 


Arg 


Thr 


Ser 


Ser 


Cys Gly Ser 


Leu 








155 










160 






165 


Thr 


Pro 


Glu 


Gin Ser 


Arg 


Gly 


Pro 


Leu 


Arg 


Ala 


Arg Arg Pro 


Met 








. 170 










175 






180 


Arg Thr 


Ala 


Gly He 


Ala 


Gly 


Trp 


His 


Gly 


Trp 


Ala Pro 





185 190 

<210> 55 

<211> 282 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 171377 . 1 .orf 3 : 2000MAY19 
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<400> 55 
Arg Pro Gin Pro 
1 

Ser Gin Pro Ala 


Ser 


Arg 


Ser Gly 


Pro 


Arg 


Ser Pro 


Ala 


Pro 


Thr Pro 


Thr 


Glu 


Arg Arg 


Gin 


Val 


Asp Arg 


Glu 


Pro 


Leu Gly 


Phe 


Ala 


Phe Gly 


Met 


Val 


Arg Cys 


Val 


Ala 


Phe Gly 


Met 


Pro 


Leu Cys 


Met 


Gly 


Asp Phe 


lie 


Phe 


S er Phe 


Arg 


Phe 


His Asn 


Asp 


Phe 


Cys Val 


Ala 


Ala 


Ala Trp 


Arg 


Pro 


Ser Ser 


Glu 


Ala 


Val Cys 



<210> 56 
<211> 211 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



Arg 


Ala 


Arg Thr 


Gin 


Thr 


Gly Leu 


Ala 


Arg 


Leu Leu 


Val 


Ser 


Ala Ala 


Pro 


Ala 


Val Ala 


Ala 


Gly 


Arg Arg 


Leu 


Val 


Gly Leu 


lie 


Lys 


Val Leu 


Cys 


Gly 


Ser Tyr 


Asn 


Glu 


Ala Lys 


Pro 


Cys 


Arg Leu 


Glu 


Glu 


Ser Ser 


Ala 


Pro 


Ala Glu 


Tyr 


Thr 


Met Ala 


Tyr 


Thr 


Glu Asn 


Val 


Ser 


Phe Thr 


Lys 


Gly 


Leu Thr 


Thr 


Ala 


Ala Met 


Ala 


Gly 


Ala Thr 



Ala 


Ala 


Pro 


Pro Arg 


10 






Arg 


Pro 


Thr 


Asp Gly 


25 






Cvs 


Pro 


Gly 


Pro Ala 


40 








Ser 


Arcr 


Pro 


Glu Ser 


55 








Ala 


Pro 


Ser 


Met Ser 


70 








Aso 


Arcr 


Ser 


Pro Arg 


85 








Arg 


Tro 


Arg 


Arg Leu 


100 








Gin 


Trp 


Leu 


Phe Ala 


115 






Ser 


Glv 


Glu 


Thr Gly 


130 








Asp 


Val 


Ser 


Ser lie 


145 








His 


Arcr 


lie 


Gin Tyz 


160 








Ser 


Lvs 


Thr 


Met His 


175 






Phe 


Phe 


Val 


Thr Leu 


190 








Ala 


Leu 


Val 


He Tyr 


205 








Lys 


Arg 


Phe 


Pro Leu 

A* J, W ,U *-4» 


220 








Phe 


Phe 


Trn 


Leu Val 


235 








Asp 


Val 


Lys 


Gly Ala 


250 








Ser 


Val 


Cys 


His Gly 


265 








Pro 


Ser 


Met 





280 



PCT7US01/05896 



Pro 

15 
Arg 

30 
Ala 

45 
Gin 

60 
Ser 

75 
Gin 

90 
Glu 
105 
He 
120 
Ala 
135 
He 
150 
Glu 
165 
Leu 
180 
Gly 
195 
Leu 
210 
Val 
225 
Ala 
240 
Thr 
255 
Glu 
270 



Leu 
5 

Ser 

20 
Pro 

35 
Ala 

50 
Arg 

65 
Pro 

80 
Leu 

95 
Phe 
110 
Ser 
125 
Asn 
140 
Tyr 
155 
Asp 
17 0 
Ser 
185 
Phe 
200 
Leu 
215 
Thr 
23 0 
Gly 
245 
Leu 
260 
Ser 
275 



LG:352559 . 1 . orf 2 : 2000MAY19 



<400> 56 

Val Val Ser Ser Thr Thr Ala Ser Ala Leu Gin Ser Gin Ser Lys 
15 10 15 

Ala Leu Leu Gin Met Lys Ser Gin Glu Glu Val Glu Val Ala Gly 

20 25 30 

He Lys Leu Cys Lys Ala Met Ser Leu Gly Ser Leu Thr Phe Thr 

35 40 45 

Asp Val Ala He Asp Phe Ser Gin Asp Glu Trp Glu Trp Leu Asn 

50 55 60 

Leu Ala Gin Arg Ser Leu Tyr Lys Lys Val Met Leu Glu Asn Tyr 

65 ~ 70 75 

Arg Asn Leu Val Ser Val Gly Leu Cys He Ser Lys Pro Asp Val 

80 85 90 

He Ser Leu Leu Glu Gin Glu Lys Asp Pro Trp Val He Lys Gly 

95 100 105 

Gly Met Asn Arg Gly Leu Cys Pro Asp Leu Glu Cys Val Trp Val 
110 115 120 

Thr Lys Ser Leu Ser Leu Asn Gin Asp He Tyr Glu Glu Lys Leu 
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125 








130 




135 


Pro Pro Ala lie 


He 


Met 


Glu Arg 


Leu Lys 


Ser Tyr Asp Leu 


Glu 




140 








145 




150 


Cys Ser Thr Leu 


Gly 


Lys 


Asn Trp 


Lys Cys 


Glu Asp Leu Phe 


Glu 




155 








160 




165 


Arg Glu Leu Val 


Asn 


Gin 


Lys Thr 


His Phe 


Arg Gin Glu Thr 


He 




170 








175 


180 


Thr His lie Asp 


Thr 


Leu 


He 


GlU 


Lys Arg 


Asp His Ser Asn 


Lys 




185 








190 




195 


Ser Gly Thr Val 


Phe 


His 


Leu 


Asn 


Thr Leu 


Ser Tyr He Lys 


Gin 




200 








205 




210 


He 
















<210> 57 
















<211> 366 
















<212> PRT 
















<213> Homo sapiens 














<220> 
















<221> misc_feature 














<223> Incyte ID 


No: 


LG: 2473 84.1. 


orf2:2000MAY19 




<400> 57 
















Arg Arg Gin Leu 


Gly 


vai 


Ala 

Aid 


Leu 


He Pro 


ber mis Axy ixieu 


Asp 


1 


5 








ill 




15 


Tyr Lys Ser Ser 


Leu 


i ±e 


tain 


Asp 


uiy Asn 


Fro JMet. ValU Asn 


Leu 




20 








z b 




30 


Glu Lys Gin Leu 


He 


Cys 


Pro 


± xe 


^ys ljeu 


ijj.u ixiec rxie jLiir 


Lys 




35 












45 


Pro Val Val He 


Leu 


Pro 


Cys 


pi Y-v 

ij-Ln 


rilS Asn 


Leu Cys Arg Lys 


Cys 




50 












60 


Ala Asn Asp He 


Phe 


r>~\ ri 

va-Ln 


Ala 


ser 


/ISII irLU 


THrv" T.^ll Brn TH-IY* 
Ijl LifciU. srS-U ±1X1. 


Arg 




65 








70 




75 


Gly Gly Thr Thr' 


Met 








n~\ \r Arrr 
«xy r\i. y 


TD"h o JVrrr f'v/'ci Pro 
it rii y ^ jr * ^ 


Ser 




80 








ft R 

O ZJ 




90 


Cys Arg His Glu 


Val 


veil 


T.on 


Asp 


ri-L y n J. o 


Gly Val Tyr Gly 


Leu 




95 








ion 

-L \J \J 




105 


Gin Arg Asn Leu 


Leu 


Val 


Glu 


Asn 


He He 


Asrj He Tvr Lvs 


Gin 




110 








115 


120 


Glu Cys Ser Ser 


Arg 


Pro 


Leu 


Gin 


Lys Gly 


Ser His Pro Met 


Cys 




125 








13 0 




135 


Lys Glu His Glu 


Asp 


Glu 


Lys 


He 


Asn He 


Tyr Cys Leu Thr 


Cys 




140 








145 




150 


Glu Val Pro Thr 


Cys 


Ser 


Met 


Cys 


Lys Val 


Phe Gly He His 


Lys 




155 








160 




165 


Ala Cys Glu Val 


Ala 


Pro 


Leu 


Gin 


Ser Val 


Phe Gin Gly Gin 


Lys 




170 








175 




180 


Thr Glu Leu Asn 


Asn 


Cys 


He 


Ser 


Met Leu 


Val Ala Gly Asn 


Asp 




185 






190 




195 


Arg Val Gin Thr 


He 


He 


Thr 


Gin 


Leu Glu 


Asp- Ser Arg Arg 


Val 




200 








205 




210 


Thr Lys Glu Asn 


Ser 


His 


Gin 


Val 


Lys Glu 


Glu Leu Ser Gin 


Lys 




215 








220 




225 


Phe Asp Thr Leu 


Tyr 


Ala 


He 


Leu 


Asp Glu 


Lys Lys Ser Glu 


Leu 




23 0 








235 




240 


Leu Gin Arg He 


Thr 


Gin 


Glu 


Gin 


Glu Lys 


Lys Leu Ser Phe 


He 




245 








250 


255 


Glu Ala Leu He 


Gin 


Gin 


Tyr 


Gin 


Glu Gin 


Leu Asp Lys Ser 


Thr 




260 






265 




270 


Lys Leu Val Glu 


Thr 


Ala 


He 


Gin 


Ser Leu 


Asp Glu Pro Gly 


Gly 




275 








280 




285 


Ala Thr Phe Leu 


Leu 


Thr 


Ala 


Lys 


Gin Leu 


He Lys Ser He 


Val 




290 






295 


300 


Glu Ala Ser Lys 


Gly 


Cys 


Gin 


Leu 


Gly Lys 


Thr Glu Gin Gly 


Phe 




305 








310 




315 


Glu Asn Met Asp 


Phe 


Phe 


Thr 


Leu 


Asp Leu 


Glu His He Ala 


Asp 
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320 325 330 

Ala Leu Arg Ala lie Asp Phe Gly Thr Asp Glu Glu Glu Glu Glu 

335 340 345 

Phe lie Glu Glu Glu Asp Gin Glu Glu Glu Glu Ser Thr Glu Gly 

350 355 360 

Lys Glu Glu Gly His Gin 

365 

<210> 58 
<211> 326 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 403872 . 1 .orf 3 :2000MAY19 
<220> 

<221> unsure 
<222> 294 

<223> unknown or other 
<400> 58 

Glu Met Ala Val Gly Asn Asn Thr Gin Arg Ser Tyr Ser lie He 
15 10 15 

Pro Cys Phe He Phe Val Glu Leu Val He Met Ala Gly Thr Val 
20 25 30 

Leu Leu Ala Tyr Tyr Phe Glu Cys Thr Asp Thr Phe Gin Val His 
35 40 45 

He Gin Gly Phe Phe Cys Gin Asp Gly Asp Leu Met Lys Pro Tyr 
50 55 60 

Pro Gly Thr Glu Glu Glu Ser Phe He Thr Pro Leu Val Leu Tyr 
65 70 75 

Cys Val Leu Ala Ala Thr Pro Thr Ala He He Phe He Gly Glu 
80 85 90 

He Ser Met Tyr Phe He Lys Ser Thr Arg Glu Ser Leu He Ala 
95 100 105 

Gin Glu Lys Thr He Leu Thr Gly Glu Cys Cys Tyr Leu Asn Pro 

110 115 120 

Leu Leu Arg Arg He He Arg Phe Thr Gly Val Phe Ala Phe Gly 

125 130 135 

Leu Phe Ala Thr Asp He Phe Val Asn Ala Gly Gin Val Val Thr 

140 145 150 

Gly His Leu Thr Pro Tyr Phe Leu Thr Val Cys Lys Pro Asn Tyr 

155 160 ~ 165 

Thr Ser Ala Asp Cys Gin Ala His His Gin Phe He Asn Asn Gly 

170 175 180 

Asn' He Cys Thr Gly Asp Leu Glu Val He Glu Lys Ala Arg Arg 

185 190 195 

Ser Phe Pro Set Lys His Ala Ala Leu Ser He Tyr Ser Ala Leu 

200 205 210 

Tyr Ala Thr Met Tyr He Thr Ser Thr He Lys Thr Lys Ser Ser 

215 220 225 

Arg Leu Ala Lys Pro Val Leu Cys Leu Gly Thr Leu Cys Thr Ala 

230 235 240 

Phe Leu Thr Gly Leu Asn Arg Val Ser Glu Tyr Arg Asn His Cys 

245 250 255 

Ser Asp Val He Ala Gly Phe He Leu Gly Thr Ala Val Ala Leu 

260 265 270 

Phe Leu Gly Met Cys Val Val His Asn Phe Lys Gly Thr Gin Gly 

275 280 285 

Ser Pro Ser Lys Pro Lys Pro Glu Xaa Pro Arg Gly Val Pro Leu 

290 295 300 

Met Ala Phe Pro Arg He Glu Ser Pro Leu Glu Thr Leu Ser Ala 

305 310 315 

Gin Asn His Ser Ala Ser Met Thr Glu Val Thr 

320 325 
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<210> 59 

<211> 156 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1135213 . 1 . orfl :2000MAY19 
<400> 59 



Leu Cys 


Gly Asp 


Tyr 


Ser 


Cys 

-s* jr w 


Leu 


Thr 


Thr 


Glu 


Phe Pro Thr Glu 


1 




5 










10 




15 


lie Met 


Glu Glu 


Lys 


Gin 


Gin 


He 


He 


Leu 


Ala 


Asn Gin Asp Gly 






20 










25 




30 


Gly Thr 


Val Ala 


Gly 


Ala 


Ala 


Pro 


Thr 


Phe 


Phe 


Val He Leu Lys 






35 










40 




45 


Gin Pro 


Gly Asn 


Gly 


Lys 


Thr 


Asp Gin 


Gly 


He 


Leu Val Thr Asn 






50 










55 




60 


Gin Asp 


Ala Cys 


Ala 


Leu 


Ala 


Ser 


Ser 


Val 


Ser 


Ser Pro Val Lys 






65 










70 




75 


Ser Lys 


Gly Lys 


He 


Cys 


Leu 


Pro 


Ala 


Asp 


cys 


Thr Val Gly Gly 






80 










85 




90 


lie Thr 


Val Thr 


Leu 


Asp 


Asn 


Asn 


Ser 


Met 


Trp 


Asn Glu Phe Tyr 






95 








100 


105 


His Arg, 


Ser Thr 


Glu 


Met 


He 


Leu 


Thr 


Lys 


Gin 


Gly Arg Arg Met 






110 










115 




120 


Phe Pro 


Tyr Cys 


Arg 


Tyr 


Trp 


He 


Thr 


Gly 


Leu 


Asp Ser Asn Leu 






125 










130 




135 


Lys Tyr 


lie Leu 


Val 


Met 


Asp 


He 


Ser 


Pro 


Val 


Asp Asn His Arg 






140 










145 




150 


Tyr Lys 


Trp Asn 


Gly 


Arg 















155 

<210> 60 

<211> 262 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 474284 . 2 . orf 2 : 2000MAY19 
<400> 60 



Ser 


Ser 


Pro 


Thr 


Ser 


Trp 


Arg 


Ser Ser Met 


Pro 


Cys Thr Trp Arg 


1 








5 






10 




15 


Ser 


Arg 


Arg 


Arg 


Arg 


Cys 


Thr 


Ala Cys Ser 


Ala 


Ala Ala Ala Pro 










20 






25 




30 


Pro 


Leu 


Pro 


Ala 


Gin 


Lys 


Val 


Cys Leu Arg 


Cys 


Glu Ala Pro Cys 










35 






40 




45 


Cys 


Gin 


Ser 


His 


Val 


Gin 


Thr 


His Leu Gin 


Gin 


Pro Ser Thr Ala 










50 






55 




60 


Arg 


Gly 


His 


Leu 


Leu 


Val 


Glu 


Ala Asp Asp 


Val 


Arg Ala Trp Ser 










65 






70 




75 


Cys 


Pro 


Gin 


His 


Asn 


Ala 


Tyr 


Arg Leu Tyr 


His 


Cys Glu Ala Glu 










80 






85 




90 


Gin 


Val 


Ala 


Val 


Cys 


Gin 


Tyr 


Cys Cys Tyr 


Tyr 


Ser Gly Ala His 










95 




100 




105 


Gin 


Gly 


His 


Ser 


Val 


Cys 


Asp 


Val Glu He 


Arg 


Arg Asn Glu He 










110 






115 




120 


Arg 


Lys 


Met 


Leu 


Met 


Lys 


Gin 


Gin Asp Arg 


Leu 


Glu Glu Arg Glu 










125 






130 




135 


Gin 


Asp 


He 


Glu 


Asp 


Gin 


Leu 


Tyr Lys Leu 


Glu 


Ser Asp Lys Arg 










140 






145 




150 


Leu 


Val 


Glu 


Glu 


Lys 


Val 


Asn 


Gin Leu Lys 


Glu 


Glu Val Arg Leu 










155 






160 




165 


Gin 


Tyr 


Glu 


Lys 


Leu 


His 


Gin 


Leu Leu Asp 


Glu 


Asp Leu Arg Gin 










170 






175 




180 
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Thr 


Val 


Glu 


Val Leu 
185 


Asp 


Lys 


Ala 


Gin Ala Lys Phe Cys Ser Glu 
190 195 


Asn 


Ala 


Ala 


Gin Ala 
200 


Leu 


His 


Leu 


Gly Glu Arg Met Gin Glu Ala 
205 210 


Lys 


Lys 


Leu 


Leu Gly 
215 


Ser 


Leu 


Gin 


Leu Leu Phe Asp Lys Thr Glu 
220 225 


Asp 


Val 


Ser 


Phe Met 
230 


Lys 


Asn 


Thr 


Lys Ser Val Lys He Leu Met 
235 * 240 


Asp 


Ser 


Arg 


Cys Pro 
245 


Val 


His 


Trp 


Pro Gin Asp Pro Asp Leu His 
250 255 


Glu 


Gin 


Gin 


Pro Phe 
260 


Pro 


His 







<210> 61 

<211> 132 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 342147 . 1 . orf 3 : 2000MAY19 
<400> 61 



Lys 


Thr 


Asn 


Leu 


Tyr 


Cys Ser 


Pro Tyr 


Phe 


He 


Asp Cys Asn Arg 


1 








5 






10 






15 


Ser 


He 


Glu 


Val 


Thr 
20 


Phe He 


Leu Ser 


Trp 
25 


He 


Val 


Cys Ser- Tyr 
30 


Ala 


Val 


Cys 


Lys 


Glu 
35 


Arg Asn 


Gly Met 


Gly 
40 


Gly 


Cys 


Glu Lys Glu 
45 


Glu 


Leu 


Val 


Val 


Asp 


Phe Gly 


Gly Ala Gly 


Trp 


Arg 


Ser Leu Cys 










50 






55 






60 


Leu 


Cys 


Ser Arg 


Leu 


Gly Cys 


Ala Ala 


Pro 


Arg 


Pro 


Arg Cys Pro 










65 






70 






75 


Asp 


Phe 


Arg Arg 


Pro 


Asp Ala 


Ser Leu 


Thr 


Ser 


Ala 


Ser Ala Arg 










80 






85 






90 


^Gly 


Cys 


Trp 


Arg 


Pro 
95 


Ser Trp 


Leu Arg 


Ser 
100 


Ala 


Pro 


Pro Arg Ser 
105 


Pro 


Pro 


Thr 


Thr 


Cys 
110 


Ala His 


Pro Ala 


Trp 
115 


Arg 


Cys 


Pro Ser Pro 
120 


Arg 


Cys 


Arg 


Arg 


Thr 
125 


Pro Ala 


Pro Phe 


Arg 
13 0' 


Cys 


Cys 





<210> 62 
<211> 167 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 10973 00.1. orf 2 :2000MAY19 
<400> 62 



Pro 


Pro 


Arg 


Arg 


Arg 


Pro 


Cys 


Trp 


Phe 


Leu 


Cys 


Gly Leu Leu Ser 


1 








5 










10 




15 


Arg 


Met 


Val 


Lys 


Leu 


Phe 


He 


Gly 


Asn 


Leu 


Pro 


Arg Glu Ala Thr 










20 








25 




30 


Glu 


Gin 


Glu 


He 


Arg 


Ser 


Leu 


Phe 


Glu 


Gin 


Tyr 


Gly Lys Val Leu 










35 










40 




45 


Glu 


c ys 


Asp 


He 


He 


Lys 


Asn 


Tyr 


Gly 


Phe 


Val 


His He Glu Asp 










50 










55 




60 


Lys 


Thr 


Ala 


Ala 


Glu 


Asp 


Ala 


He 


Arg 


Asn 


Leu 


His His His Lys 










65 










70 




75 


Pro 


His 


Gly 


Val 


Asn 


He- 


Asn 


Ala 


Glu 


Ala 


Ser 


Lys Asn Lys Ser 










80 










85 




90 


Lys 


Ala 


Pro 


Thr 


Lys 


Leu 


His 


Val 


Gly 


Asn 


He 


Ser Pro Thr Cys 










95 










100 




105 


Thr 


Asn 


Gin 


Glu 


Leu 


Arg 


Ala 


Lys 


Phe 


Glu 


Glu 


His Gly Pro Ala 
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110 



115 



120 



lie Glu Cys Asp lie Ala Lys Asp Tyr Ala Phe Ala His Met Glu 
125 130 135 

Arg Ala Glu Asp Ala Ala Glu Ala lie Arg Gly Leu Asp Asn Thr 
140 145 150 

Glu Phe Gin Gly Glu Leu Leu Trp Ala Trp Val Val Ala Pro Ser 
155 160 165 

Gly Val 



<210> 63 

<211> 570 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 444850 . 9 . orf 1 : 2000MAY19 
<220> 

<221> -unsure 

<222> 569-570 

<223> unknown or other 

<400> 63 

Lys His Arg Gin Glu Asn Asn Ala Leu Asp Met Ala Pro Glu lie 
15 10 15 

His Met Thr Gly Pro Met Cys Leu He Glu Asn Thr Asn Gly Glu 
20 ^ 25 30 

Leu Val Ala Asn Pro Glu Ala Leu Lys He Leu Ser Ala He Thr 
35 40 45 

Gin Pro Val Val Val Val Ala He Val Gly Leu Tyr Arg Thr Gly 
50 " 55 60 

Lys Ser Tyr Leu Met Asn Lys Leu Ala Gly Lys Asn Lys Gly Phe 
65 70 75 

Ser Leu Gly Ser Thr Val Lys Ser His Thr Lys Gly He Trp Met 
80 85 90 

Trp Cys Val Pro His Pro Lys Lys Pro Glu His Thr Leu Val Leu 
95 100 105 

Leu Asp Thr Glu Gly Leu Gly Asp Val Lys Lys Gly Asp Asn Gin 

110 " 115 120 

Asn Asp Ser Trp lie Phe Thr Leu Ala Val Leu Leu Ser Ser Thr 

125 130 135 

Leu Val Tyr Asn Ser Met Gly Thr He Asn Gin Gin Ala Met Asp 

140 145 150 

Gin Leu Tyr Tyr Val Thr Glu Leu Thr His Arg He Arg Ser Lys 

155 160 165 

Ser Ser Pro Asp Glu Asn Glu Asn Glu Asp Ser Ala Asp Phe Val 

170 175 180 

Ser Phe Phe Pro Asp Phe Val Trp Thr Leu Arg Asp Phe Ser Leu 

185 190 195 

Asp Leu Glu Ala Asp Gly Gin Pro Leu Thr Pro Asp Glu Tyr Leu 

200 ~ 205 210 

Glu Tyr Ser Leu Lys Leu Thr Gin Gly Thr Ser Gin Lys Asp Lys 

215 220 225 

Asn Phe Asn Leu Pro Gin Leu Cys He Trp Lys Phe Phe Pro Lys 

230 235 240 

Lys Lys Cys Phe Val Phe Asp Leu Pro He His Arg Arg Lys Leu 

245 250 255 

Ala Gin Leu Glu Lys Leu Gin Asp Glu Glu Leu Asp Pro Glu Phe 

260 265 270 

Val Gin Gin Val Ala Asp Phe Cys Ser Tyr He Phe Ser Asn Ser 

275 ~ 280 285 

Lys Thr Lys Thr Leu Ser Gly Gly He Lys Val Asn Gly Pro Arg 

290 295 300 

Leu Glu Ser Leu Val Leu Thr Tyr He Asn Ala He Ser Arg Gly 



305 



310 



315 
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Asp 


Leu 


Pro 


Cys 


Met 


Glu 


Asn 


Ala Val 


Leu 


Ala 


Leu Ala Gin He 






320 








325 




330 


Glu 


Asn 


Ser 


Ala 


Ala 


Val 


Gin 


Lys Ala 


He 


Ala 


His Tyr Asp Gin 










335 






340 




345 


Gin 


Met 


Gly Gin 


Lys 


Val 


Gin 


Leu Pro 


Ala 


Glu 


Thr Leu Gin Glu 










350 








355 




360 


Leu 


Leu 


Asp Leu 


His 


Arg 


Val 


Ser Glu 


Arg 


Glu 


Ala Thr Glu Val 










365 








370 




375 


Tyr 


Met 


Lys Asn 


Ser 


Phe 


Lys 


Asp Val 


Asp 


His 


Leu Phe Gin Lys 










380 








385 




390 


Lys 


Leu 


Ala 


Ala 


Gin 


Leu 


Asp 


Lys Lys 


Arg 


Asp 


Asp Phe Cys Lys 










395 








400 




405 


Gin 


Asn 


Gin 


Glu 


Ala 


Ser 


Ser 


Asp Arg 


Cys 


Ser 


Ala Leu Leu Gin 










410 






415 




420 


Val 


lie 


Phe 


Ser 


Pro 


Leu 


Glu 


GlU Glu 


Val 


Lys 


Ala Gly He Tyr 










425 








430 




435 


Ser 


Lys 


Pro Gly 


Gly 


Tyr 


Cys 


Leu Phe 


He 


Gin 


Lys Leu Gin Asp 










440 








445 




450 


Leu 


Glu 


Lys 


Lys 


Tyr 


Tyr 


Glu 


Glu Pro 


Arg 


Lys 


Gly He Gin Ala 










455 








460 




465 


Glu 


Glu 


He 


Leu 


Gin 


Thr 


Tyr 


Leu Lys 


Ser 


Lys 


Glu Ser Val Thr 










470 








475 




480 


Asp 


Ala 


He 


Leu 


Gin 


Thr 


Asp 


Gin He 


Leu 


Thr 


Glu Lys Glu Lys 










485 








490 




495 


Glu 


lie 


Glu 


Val 


Glu 


Cys 


Val 


Lys Ala 


Glu 


Ser 


Ala Gin Ala Ser 










500 




505 




510 


Ala 


Lys 


Met 


Val 


Glu 


Glu 


Met 


Gin He 


Lys 


Tyr 


Gin Gin Met Met 










515 








520 




525 


Glu 


Glu 


Lys 


Glu 


Lys 


Ser 


Tyr 


Gin Glu 


His 


Val 


Lys Gin Leu Thr 










53 0 








535 




540 


Glu 


Lys 


Met 


Glu 


Arg 


Glu 


Arg 


Ala Gin 


Leu 


Leu 


Glu Glu Gin Glu 










545 








550 




555 


Lys 


Thr 


Leu 


Thr 


Ser 


Lys 


Leu 


Gin Val 


Ser 


Lys 


Cys Lys Xaa Xaa 










560 








565 




570 



<210> 64 
<211> 168 
<21*2> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 402231 . 6 . orf 3 : 2000MAY19 
<400> 64 



Ala 


Leu 


Phe 


Ser 


Arg 


He 


He 


Gin Gin Leu 


Val 


Asn Gly He He 


1 








5 






10 




15 


Thr 


Pro 


Ala 


Thr 


He 


Pro 


Ser 


Leu Gly Pro 


Trp 


Gly Val Leu His 










20 






25 




30 


Ser 


Asn 


Pro 


Met 


Asp 


Tyr 


Ala 


Trp Gly Ala 


Asn 


Gly Leu Asp Ala 










35 






40 




45 


lie 


He 


Thr 


Gin 


Leu 


Leu 


Asn 


Gin Phe Glu 


Asn 


Thr Gly Pro Pro 










50 






55 




60 


Pro 


Ala 


Asp 


Lys 


Glu 


Lys 


He 


Gin Ala Leu 


Pro 


Thr Val Pro Val 








65 




70 




75 


Thr 


Glu 


Glu 


His 


Val 


Gly 


Ser 


Gly Leu Glu 


Cys 


Pro Val Cys Lys 










80 






85 




90 


Asp 


Asp 


Tyr 


Ala 


Leu 


Gly 


Glu 


Arg Val Arg 


Gin 


Leu Pro Cys Asn 










95 






100 




105 


His 


Leu 


Phe 


His 


Thr 


Thr 


Tyr 


Glu Gin Ala 


Trp 


Leu Glu Gin His 










110 






115 


120 


Asp 


Ser 


Cys 


Pro 


Val 


Cys 


Arg 


Lys Ser Leu 


Thr 


Gly Gin Asn Thr 










125 






13 0 




135 


Ala 


Thr 


Asn 


Pro 


Pro 


Gly 


Leu 


Thr Gly Val 


Ser 


Phe Ser Ser Ser 










140 






145 




150 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser Pro Ser 


Asn 


Glu Asn Ala Thr 
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155 160 165 

Ser Asn Ser 



<210> 65 

<211> 246 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 1076157 . 1 . orf 3 : 2000MAY19 
<220> 

<221> unsure 
<222> 240 

<223> unknown or other 
<400> 65 



Pro 


Lys 


Gin 


Gly He 




Val 

Vul 


Ai y 


Ser 




ni y 


UlO XT J- W 


V3l U. 




1 






5 










10 








15 


Phe 


Leu 




Tip 




ax y 


Jtr X 


XrX U 




XJtrU. 


OCX XJtiU 


Cqv 
oei 


Pro 








20 








25 








30 


lie 


Leu 


Leu 


TH/T" r T r hT" 








Php 
rile 






Pto \7a 1 

£ J- v> val 


2X1 a 

rild. 


Jrlie 








35 










40 






45 


Lys 


Asp 


Val 


Ala Val 


Sen 


IT 1 Its 


1111 


fil n 

VjJ.ll 




Hi 11 


lip nia 


Leu 


Leu 








50 










55 






60 


Asp 


JL JL fc= 


C or 
O CI 


oi.il niy 




_ 

Lieu 


xyr 


ax y 


Pin 


Val 


JXlfcsL LlcU 


GlU 


Thr 








65 










70 








75 


Phe 


Arg 


Asn 


Leu Thr 


Ser 


He 


Gly 


Lys 


Lys 


Trp 


Lys Asp 


Gin 


Asn 








80 










85 








90 


He 


Glu 


Tyr 


Glu Tyr 


Gin 


Asn 


Pro 


Arg 


Arg 


Asn 


Phe Arg 


Ser 


Leu 








95 










100 








105 


He 


Glu 


Gly 


Asn Val 


Asn 


Glu 


He 


Lys 


Glu 


Asp 


Ser His 


Cys 


Gly 








110 










115 








120 


Glu 


Thr 


Phe 


Thr Gin 


Val 


Pro 


Asp 


Asp 


Arg 


Leu 


Asn Phe 


Gin 


Glu 








125 






130 








135 


Lys 


Lys 


Ala 


Ser Pro 


Glu 


Ala 


Lys 


Ser 


Cys 


Asp 


Asn Phe 


Val 


Cys 








140 










145 








150 


Gly 


Glu 


Val 


Gly He 


Gly 


Asn 


Ser 


Ser 


Phe 


Asn 


Met Asn 


He 


Arg 








155 










160 








165 


Gly 


Asp 


He 


Gly His 


Lys 


Ala 


Tyr 


Glu 


Tyr 


Gin 


Asp Tyr 


Ala 


Pro 








170 










175 








180 


Lys 


Pro 


Tyr 


Lys Cys 


Gin 


Gin 


Pro 


Lys 


Lys 


Ala 


Phe Arg 


Tyr 


His 








185 










190 








195 


Pro 


Ser 


Phe 


Arg Thr 


Gin 


Glu 


Arg 


Asn 


His 


Thr 


Gly Glu 


Lys 


Pro 








200 










205 








210 


Tyr 


Ala 


Cys 


Lys Glu 


Cys 


Gly 


Lys 


Thr 


Phe 


He 


Ser His 


Ser Gly 








215 










220 








225 


He 


Arg 


Arg 


Arg Met 


Val 


Met 


His 


Ser 


Gly 


Asp 


Gly Pro 


Leu 


Xaa 








230 










235 








240 


Val 


Ser 


Phe 


Val Gly 


Lys 



















245 

<210> 66 
<211> 120 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 1083142 . 1 . orf3 : 2000MAY19 
<220> 

<221> unsure 
<222> 1 

<223> unknown or other 
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<400> 66 



Xaa 


Phe 


Pro 


Val 


Leu 


Glu 


Pro 


His 


Gin 


Val 


Gly 


Leu He Arg 


Ser 


1 








5 










10 




15 


Tyr 


Asn 


Ser 


Lys 


Thr 


Met 


Thr 


cys 


Phe 


Gin 


Glu 


Leu Val Thr 


Phe 










20 










25 






30 


Arg 


Asp 


Val 


Ala 


He 


Asp 


Phe 


Ser 


Arg 


Gin 


Glu 


Trp Glu Tyr Leu 










35 










40 






45 


Asp 


Pro 


Asn 


Gin 


Arg 


Asp 


Leu 


Tyr 


Arg 


Asp 


Val 


Met Leu Glu 


Asn 










50 










55 






60 


Tyr 


Arg 


Asn 


Leu 


Val 


Ser 


Leu 


Gly 


Gly 


His 


Ser 


He Ser Lys 


Pro 










65 










70 






75 


Val 


Val 


Val 


Asp 


Leu 


Leu 


Glu 


Arg 


Gly 


Lys 


Glu 


Pro Trp Met 


He 










80 










85 






90 


Leu 


Arg 


Glu 


Glu 


Thr 


Gin 


Phe 


Thr 


Asp 


Leu 


Asp 


Leu Gin Cys 


Glu 










95 










100 






105 


He 


He 


Ser 


Tyr 


He 


Glu 


Val 


Pro 


Thr 


Tyr 


Glu 


Thr Asp He 


Ser 



110 115 120 



<210> 67 

<211> 122 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1083264 . 1 . orf 2 : 2000MAY19 
<400> 67 



Lys 


Lys 


Ser 


Gin 


Lys 


Glu Ser 


Thr Gin Gin 


Thr 


Arg 


He 


His 


Phe 


1 








5 




10 










15 


Gin 


Arg 


Asp 


He 


Leu 


Cys Lys 


Glu Ala Thr 


Trp 


Lys 


Arg Lys 


Glu 










20 




25 










30 


Lys 


Lys 


Ser 


Gly 


Met 


Ala Leu 


Thr Gin Gly 


Pro 


Leu 


Lys 


Phe 


Met 










35 




40 










45 


Asp 


Val 


Ala 


He 


GlU 


Phe Ser 


Gin Glu Glu 


Trp 


Lys 


Cys 


Leu Asp 










50 




55 










60 


Pro 


Ala 


Gin Arg 


Thr 


Leu Tyr 


Arg Asp Val 


Met 


Leu 


Glu 


Asn 


Tyr 










65 




70 










75 


Arg 


Asn 


Leu 


Val 


Ser 


Leu Gly 


He Cys Leu 


Pro 


Asp 


Leu 


Ser 


Val 










80 


85 










90 


Thr 


Ser 


Met 


Leu 


Glu 


Gin Lys 


Arg Asp Pro 


Trp 


Thr 


Leu 


Gin 


Ser 










95 




100 










105 


Glu 


Glu 


Lys 


He 


Ala 


Asn Asp 


Pro Asp Gly 


Arg 


Glu Cys 


He 


Gin 










110 




115 










120 


Lys 


Val 





















<210> 68 
<211> 428 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__feature 

<223> Incyte ID No: LG : 350793 . 2 . orf 3 : 2000MAY19 
<400> 68 



Ala Gin Gly Ser 


Ser 


Trp Lys Leu Pro 


Phe 


Glu 


Arg Leu Ala Phe 


1 








5 


10 




15 


Val 


Leu 


Ser 


Ser 


Asn 


Ser Leu Lys His 


Cys 


Thr 


Glu Leu Glu Leu 










20 


25 




30 


Phe 


Lys 


Ala 


Thr 


Cys 


Arg Trp Leu Arg Leu Glu 


Glu Pro Arg Met 










35 




40 




45 


Asp 


Phe 


Ala 


Ala 


Lys 


Leu Met Lys Asn 


He 


Arg 


Phe Pro Leu Met 










50 


55 




60 


Thr 


Pro 


Gin 


Glu 


Leu 


He Asn Tyr Val 


Gin 


Thr 


Val Asp Phe Met 
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65 










70 






75 


Arg 


Thr 


Asp Asn 


Thr 


Cys 


Val 


Asn 


Leu 


Leu 


Leu 


Glu 


Ala Ser Asn 








80 








85 






90 


Tyr 


Gin 


Met Met 


Pro 


Tyr 


Met 


Gin 


Pro 


Val 


Met 


Gin 


Ser Asp Arg 








95 








100 






105 


Thr 


Ala 


He Arg 


Ser 


Asp 


Thr 


Thr His 


Leu 


Val 


Thr 


Leu Gly Gly 








110 










115 






120 


Val 


Leu 


Arg Gin 


Gin 


Leu 


Val 


Val 


Ser 


Lys 


Glu 


Leu 


Arg Met Tyr 








125 










130 






135 


Asp 


Glu 


Lys Ala 


His 


Glu 


Trp 


Lys 


Ser 


Leu 


Ala 


Pro 


Met Asp Ala 








140 








145 






150 


Pro 


Arg 


Tyr Gin 


His 


Gly 


He 


Ala Val 


He 


Gly 


Asn 


Phe Leu Tyr 








155 








160 






165 


Val 


Val 


Gly Gly 


Gin 


Ser 


Asn 


Tyr Asp 


Thr 


Lys 


Gly 


Lys Thr Ala 








170 










175 






180 


Val 


Asp 


Thr Val 


Phe 


Arg 


Phe 


Asp 


Pro 


Arg 


Tyr 


Asn 


Lys Trp Met 








185 










190 






195 


Gin 


Val 


Ala Ser 


Leu 


Asn 


Glu 


Lys 


Arg 


Thr 


Phe 


Phe 


His Leu Ser 








200 










205 






210 


Ala 


Leu 


Lys Gly 


Tyr 


Leu 


Tyr 


Ala 


Val 


Gly 


Gly 


Arg 


Asn Ala Ala 








215 










220 






225 


Gly 


GlU 


Leu Pro 


Thr 


Val 


Glu 


Cys 


Tyr 


Asn 


Pro 


Arg 


Thr Asn Glu 








230 










235 






240 


Trp 


Thr 


Tyr Val 


Ala 


Lys 


Met 


Ser 


Glu 


Pro 


His 


Tyr 


Gly His Ala 








245 










250 






255 


Gly 


Thr 


Val Tyr 


Gly 


Gly 


Val 


Met 


Tyr 


He 


Ser 


Gly 


Gly He Thr 








260 










265 






270 


His 


Asp 


Thr Phe 


Gin 


Lys 


Glu 


Leu 


Met 


Cys 


Phe 


Asp 


Pro Asp Thr 








275 










280 






285 


Asp 


Lys 


Trp He 


Gin 


Lys 


Ala 


Pro 


Met 


Thr 


Thr 


Val 


Arg Gly Leu 








290 










295 






300 


His 


Cys 


Met Cys 


Thr 


Val 


Gly 


Glu Arg 


Leu 


Tyr 


Val 


He Gly Gly 








305 










310 






315 


Asn 


His 


Phe Arg 


Gly 


Thr 


Ser 


Asp Tyr 


Asp 


Asp 


Val 


Leu Ser Cys 








320 










325 






330 


Glu 


Tyr 


Tyr Ser 


Pro 


He 


Leu 


Asp Gin 


Trp 


Thr 


Pro 


He Ala Ala 








335 










340 






345 


Met 


Leu 


Arg Gly 


Gin 


Ser 


Asp 


Val 


Gly 


Val 


Ala 


Val 


Phe Glu Asn 








350 










355 






360 


Lys 


He 


Tyr Val 


Val 


Gly 


Gly 


Tyr 


Ser 


Trp 


Asn 


Asn 


Arg Cys Met 








365 










370 






375 


Val 


Glu 


He Val 


Gin 


Lvs 


Tvr 


Asp 


Pro 


Asp 


Lvs 


Asp 


Glu Trp His 








380 










385 






390 


Lys 


Val 


Phe Asp 


Leu 


Pro 


Glu 


Ser 


Leu 


Gly 


Gly 


He 


Arg Ala Cys 








395 










400 






405 


Thr 


Leu 


Thr Val 


Phe 


Pro 


Pro 


Glu 


Glu 


Thr 


Thr 


Pro 


Ser Pro Ser 








410 










415 






420 


Arg 


Glu 


Ser Pro 


Leu 


Ser 


Ala 


Pro 


















425 



















<210> 69 
<211> 307 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 408751 . 3 . orf 2 : 2000MAY19 
<400> 69 

Arg Asp Pro Gly Trp Gin He Arg Asp Arg Ala Gly Leu Ala Trp 
15 10 15 

Asn Met Leu Ala Asn Ser Ala Ser Val Arg He Leu He Lys Gly 

20 25 30 

Gly Lys Val Val Asn Asp Asp Cys Thr His Glu Ala Asp Val Tyr 

35 40 45 

He Glu Asn Gly He He Gin Gin Val Gly Arg Glu Leu Met He 
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50 








55 




60 


Pro 


Gly 


Gly 


Ala Lys 


Val 


He 


Asp Ala 


Thr 


Gly 


Lys Leu Val He 








65 








70 




75 


Pro 


Gly 


Gly 


He Asp 


Thr 


Ser 


Thr His 


Phe 


His 


Gin Thr Phe Met 






80 








85 




90 


Asn 


Ala 


Thr 


Cys Val 


Asp 


Asp 


Phe Tyr 


His 


Gly 


Thr Lys Ala Ala 








95 








100 




105 


Leu 


Val 


Gly 


Gly Thr 


Thr 


Met 


He He 


Gly 


His 


Val Leu Pro Asp 








110 








115 




120 


Lys 


Glu 


Thr 


Ser Leu 


Val 


Asp 


Ala Tyr 


Glu 


Lys 


Cys Arg Gly Leu 








125 








13 0 




135 


Ala 


Asp 


Pro 


Lys Val 


Cys 


Cys 


Asp Tyr 


Ala 


Leu 


His Val Gly He 








140 








145 




150 


Thr 


Trp 


Trp 


Ala Pro 


Lys 


Val 


Lys Ala 


Glu 


Met 


Glu Thr Leu Val 








155 




160 




165 


Arg 


Glu 


Lys 


Gly Val 


Asn 


Ser 


Phe Gin 


Met 


Phe 


Met Thr Tyr Lys 








170 








175 




180 


Asp 


Leu 


Tyr 


Met Leu 


Arg 


Asp 


Ser Glu 


Leu 


Tyr 


Gin Val Leu His 








185 








190 




195 


Ala 


Cys 


Lys 


Asp He 


Gly 


Ala 


He Ala 


Arg 


Val 


His Ala Glu Asn 








200 








205 




210 


Gly 


Glu 


Leu 


Val Ala 


Glu 


Gly 


Ala Lys 


Glu 


Ala 


Leu Asp Leu Gly 








215 








220 




225 


He 


Thr 


Gly 


Pro Glu 


Gly 


He 


Glu lie 


Ser 


Arg 


Pro Glu Glu Leu 






23 0 






235 




240 


Glu 


Ala 


GlU 


Ala Thr 


His 


Arg 


Val He 


Thr 


Arg 


Asp Gly Gly Asn 








245 








250 




255 


His 


Asp 


Ala 


Ala Ser 


Trp 


Cys 


Ser Ala 


His 


His 


Leu Tyr Pro Cys 








260 








265 




270 


Gin 


Pro 


Ser 


Leu Gly 


His 


Gly 


Pro Trp 


Ala 


Asp 


Val Lys Glu Pro 








275 








280 




285 


Ser 


Ser 


Ser 


Gly Gly 


Gly 


Gin 


Leu Gly 


Arg 


Ala 


Ser Leu Leu Gly 








290 








295 




300 


Leu 


Gly 


Lys 


Leu Tyr 


Leu 


Leu 











305 

<210> 70 

<211> 198 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 33 6120 . 1 . orf 1 : 2000MAY01 
<400> 70 



He 


He 


Pro 


Gin 


Arg 


Ser 


Asn 


Gly Asp Arg 


Trp 


Gly Arg Ser Leu 


1 








5 






10 




15 


Leu 


Pro 


Ser 


Arg 


Thr 


Phe 


Leu 


Gin Ala Leu 


Asn 


Leu Gly He Glu 








20 






25 




30 


Val 


He 


Asn 


Thr 


Thr 


Asp 


Tyr 


Leu His Phe 


Ser 


Lys Glu Cys Ser 










35 






40 




45 


Arg 


Ala 


Leu 


Leu 


Lys 


Met 


Gin 


Tyr Cys Pro 


His 


Cys Gin Gly Leu 










50 






55 




60 


Ala 


Leu 


Thr 


Lys 


Pro 


Cys 


Met 


Gly Tyr Cys 


Leu 


Asn Val Met Arg 










65 






70 




75 


Gly 


Cys 


Leu 


Ala 


His 


Met 


Ala 


Glu Leu Asn 


Pro 


His Trp His Ala 










80 






85 




90 


Tyr 


He 


Arg 


Ser 


Leu 


Glu 


Glu 


Leu Ser Asp 


Ala 


Met His Gly Thr 










95 






100 




105 


Tyr 


Asp 


He 


Gly 


His 


Val 


Leu 


Leu Asn Phe 


His 


Leu Leu Val Asn 








110 






115 




120 


Asp 


Ala 


Val 


Leu 


Gin 


Ala 


His 


Leu Asn Gly 


Gin 


Lys Leu Leu Glu 










125 






13 0 




135 


Gin 


Val 


Asn 


Arg 


He 


Cys 


Gly 


Arg Pro Val 


Arg 


Thr Pro Thr Gin 










140 






145 




150 


Ser 


Pro 


Arg 


Cys 


Ser 


Phe 


Asp 


Gin Ser Lys 


Glu 


Lys His Gly Met 
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155 160 165 

Lys Thr Thr Thr Arg Asn Ser Glu Glu Thr Leu Ala Asn Arg Arg 

170 175 180 

Lys Glu Phe He Asn Ser Leu Ser Thr Val Gin Val He Leu Trp 

185 190 195 

Arg Ser Ser 



<210> 71 

<211> 227 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LI : 23 4104 . 2 . orf 1 : 2000MAY01 
<400> 71 



7\ "1 -a 

Ala 


Thr 


Pro 


ber tjiy 


Arg 


Pro 


pi« 
urn 


Ser 


Trp 


Thr 


Arg riie ber Lieu 


JL 






c 










1U 




lb 


Trp 


Arg 


c^iy 


Fro Arg 


Arg 


Thr 


Arg 


Pro 


Ser 


Pro 


Fro Aia Fro Aia 


















zb 




J u 


Pro 


Aid 


Cj ly 


Men <^iy 


Ser 




nlS 


Asp 


ijiy 


Arg 


ber vjj.y Fro vai 








J o 










a n 




A C 


beu 


Thr 


Pro 


Ala Asp 


Thr 


Lieu 


HIS 


Pro 


Pro 


Thr 


Arg Leu Gin Pro 






















oil 


Ser 


Pro 


Pro 


Asp Thr 


His 


Pro 


Gly 


Gly 


Ser 


Ser 


Leu Pro Ala Pro 








65 










70 




75 


Arg 


Pro 


Ala 


Leu Ser 


Cys 


Trp 


Ala 


Arg 


Val 


Phe 


Ala Ser Leu Val 








80 








85 




90 


Arg 


Pro 


Ala 


Gly Phe 


Pro 


Gly 


Gly 


Thr 


His 


Gly 


Ala Pro Gly Met 








95 










100 




105 


Pro 


Leu 


Gly 


Ser Pro 


Ser 


Thr 


Ser 


Thr 


Ala 


Gin 


Trp Pro Tyr Val 






110 










115 




120 


Gin 


Leu 


Val 


Pro Gly 


Pro 


Arg 


Val 


Arg 


Lys 


Thr 


Ala Ser Arg Ser 








125 










13 0 




135 


His 


Cys 


Gin 


Glu Arg 


Ala 


Glu 


Glu 


Trp 


Ser 


Gly 


Pro Arg Arg Pro 








140 










145 




150 


Trp 


Gly 


Glu 


Gly Asp 


Pro 


Gly 


Pro 


Val 


Thr 


Ala 


Thr Pro Gly Thr 








155 










160 




165 


Pro 


Gly 


Gly 


Ala Pro 


Thr 


Ser 


Ala 


Phe 


Ser 


Cys 


Ala Ala Lys Leu 








170 










175 




180 


Gin 


Lys 


Pro 


Asp Ala 


Gly 


Leu 


Val 


Val 


Ala 


Asn 


Gly Thr Met Cys 








185 










190 




195 


Cys 


Pro 


Ala 


Lys His 


Thr 


Trp 


Arg 


Ser 


Gly 


Pro 


Lys lie Pro lie 








* 200 










205 




210 


Leu 


Asp 


Phe 


His Pro 


Ala 


Pro 


Ser 


Ser 


Thr 


Pro 


Arg Ser Ala Leu 








215 










220 




225 



Ser His 



<210> 72 

<211> 122 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 450887 . 1 . orf 3 : 2000MAY01 
<400> 72 

Ser Val His Phe Ser Arg Lys Gly Phe Val Leu Met Ala Pro Pro 
15 10 15 

Gin Pro Lys Ser Gly Leu Phe Val Gly lie Asn Lys Gly His Val 

20 25 30 

Val Thr Lys Arg Glu Leu Pro Pro Arg Pro Cys His Arg Lys Gly 

35 40 45 
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Lys Ser Thr Lys Arg Val Ser Met Val Arg Gly Leu He Arg Glu 

50 55 60 
Val Ala Gly Phe Ala Pro Tyr Glu Lys Arg He Thr Glu Leu Leu 

65 70 75 
Lys Val Gly Lys Asp Lys Arg Ala Leu Lys Leu Ala Lys Arg Lys 

80 85 90 

Leu Gly Thr His Lys Arg Ala Lys Lys Lys Arg Glu Glu Met Ala 

95 100 105 

Gly Val Leu Arg Lys Met Arg Ser Ala Gly Thr His Thr Asp Lys 

110 115 120 

Lys Lys 

<210> 73 

<211> 209 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 119992 . 3 . orf 2 :2000MAY01 
<400> 73 

Cys Ser Gin He Glu Leu Ala He Glu Leu Asp Ser Thr His Leu 

1.5 10 15 

Val Thr Leu Gly Gly Val Leu Arg Gin Gin Leu Val Val Ser Lys 

20 25 * 30 

Glu Leu Arg Met Tyr Asp Glu Arg Ala Gin Glu Trp Arg Ser Leu 

35 40 45 

Ala Pro Met Asp Ala Pro Arg Tyr Gin His Gly Tyr Trp Leu Phe 

50 55 60 

He Gly Asn Phe Leu Tyr Val Val Gly Gly Gin Ser Asn Tyr Asp 

65 70 75 

Thr Lys Gly Lys Thr Ala Val Asp Thr Val Phe Arg Phe Asp Pro 

80 85 90 

Arg Tyr Asn Lys Trp Met Gin Val Ala Ser Leu Asn Glu Lys Arg 

95 100 105 

Thr Phe Phe His Leu Ser Ala Leu Lys Gly His Leu Tyr Ala Val 

110 115 120 

Gly Gly Arg Ser Ala Ala Gly Glu Leu Gly Thr Val Glu Cys Tyr 

125 130 135 

Asn Pro Arg Met Asn Glu Trp Ser Tyr Val Ala Lys Met Ser Glu 

140 145 150 

Pro His Tyr Gly His Ala Gly Thr Val Tyr Gly Gly Leu Met Tyr 

155 160 165 
He Ser Gly Gly He Thr His Asp Thr Phe 'Gin Asn Glu Leu Met 

170 175 180 

Cys Phe Asp Pro Asp Thr Asp Lys Trp Met Gin Lys Ala Pro Met 

185 190 195 
Thr Thr Val Arg Gly Leu His Cys Met Cys Thr Arg Trp Arg 
200 205 

<210> 74 
<211> 312 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 197241 . 2 .orf 1 :2000MAY01 
<400> 74 

Tyr Ser Arg He Leu He Leu Gin Met Phe He Leu Gly Ala He 

15 10 15 

He Gin He Leu Pro Trp Val Met Ala Ser Gin Asn Ser Lys His 

20 25 30 
His Pro Glu Leu Val Asp Leu Phe Ser Arg Ser Gly He Tyr He 



53/69 



WO 01/62922 



PCT/US01/05896 









35 




40 


45 


Lys 


Gill 


Val Val 


Leu Cys 


Lys 


Phe His Ser Val 


Phe Leu Ser Gin 






50 




55 


60 


Lys 


Gly 


Gin Val 


Tyr Thr 


Cys 


Gly His Gly Pro 


Gly Arg Ala He 






65 




70 


75 


Arg 


Asp 


Met Gly 


Asp Glu Gin 


Thr Cys Leu Val 


Pro Arg Leu Val 






80 




85 


90 


Glu 


Gly 


Leu Asn 


Gly His Asn 


Cys ser Gin Val 


Ala Ala Ala Lys 






95 




100 


105 


Asp 


His 


Thr Val 


Val Leu 


Thr 


Glu Asp Gly Cys 


Val Tyr Thr Phe 








110 




115 


120 


Gly 


Leu 


Asn He 


Phe His 


Gin 


Leu Gly lie lie 


Pro Pro Pro Ser 






125 




130 


135 


Ser 


Cys 


Asn Val 


Pro Arg Gin 


He Gin Ala Lys 


Tyr Leu Lys Gly 






140 




145 


150 


Arg 


Thr 


He He 


Gly Val 


Ala 


Ala Gly Arg Phe 


His Thr Val Leu 








155 




160 


165 


Trp 


Thr 


Arg Glu 


Ala Val 


Tvr 


Thr Met Gly Leu 


His Gly Gly Gin 








170 




175 


180 


Leu 


Gly 


Cys Leu 


Leu Asp 


Pro 


Asn Gly Glu Lys 


Cys Val Thr Ala 








185 




190 


195 


Pro 


Arg 


Gin Val 


Ser Ala 


Leu 


His His Lys Asp 


He Ala Leu Ser 








200 




205 


210 


Leu 


Val 


Ala Ala 


Ser Asp 


Gly 


Ala Thr Val Cys 


Val Thr Thr Arg 








215 


220 


225 


Gly 


Asp 


He Tyr 


Leu Leu 


Ala 


Asp Tyr Gin Cys 


Lys Lys Met Ala 








230 




235 


240 


Ser 


Lys 


Gin Leu 


Asn Leu 


Lys 


Lys Val Leu Val 


Ser Gly Gly His 








245 




250 


255 


Met 


Glu 


Tyr Lys 


Val Asp 


Pro 


Glu His Leu Lys 


Glu Asn Gly Gly 








260 




265 


270 


Gin 


Lys 


He Cys 


He Leu 


Ala 


Met Asp Gly Ala 


Gly Arg Val Phe 








275 




280 


285 


Cys 


Trp 


Arg Ser 


Val Asn 


Ser 


Ser Leu Lys Gin 


Cys Arg Leu Gly 








290 




295 


300 


Leu 


Ser 


Thr Ser 


Gly Ser 


Ser 


Phe Leu He Trp 


Leu 








305 




310 • 





<210> 75 

<211> 190 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 406860 . 20 .or f 3 : 2000MAY01 
<400> 75 



Leu 


Tyr 


Val 


Met 


Leu 


Glu 


Met 


Thr Arg 


Pro 


Ser 


Ser 


Leu 


Ser Leu 


1 






5 








10 








15 


Ser 


Gin 


Leu 


Ala 


Leu 


Phe 


Ser 


Arg Ala 


Val 


Leu 


Pro 


Val 


Gly Arg 










20 








25 








30 


Ala 


Glu 


Asp 


Leu 


Ala 


Gly 


Glu 


Ala Gly 


Glu 


Ala 


Cys 


Trp 


Pro Ser 










35 








40 








45 


Leu 


Cys 


Ala 


Pro 


Leu 


His 


Ala 


His Pro 


Pro 


Ala 


Pro 


Pro 


Glu Arg 








50 








55 








60 


He 


Val 


His 


Pro 


Ala 


Ala 


Arg 


Ser Leu 


Asp 


Leu 


His 


Phe Gly Ala 










65 








70 








75 


Pro 


Gly 


Arg 


Val 


Glu 


Leu 


Arg 


Cys Glu 


Val 


Ala 


Pro Ala Gly Ser 










80 








85 








90 


Gin 


Val 


Arg 


Trp 


Tyr 


Lys 


Asp 


Gly Leu 


Glu 


Val 


Glu 


Ala 


Ser Asp 








95 






100 








105 


Ala 


Leu 


Gin 


Leu 


Gly Ala 


Glu 


Gly Pro 


Thr Arg 


Thr 


Leu 


Thr Leu 










110 








115 








120 


Pro 


His 


Ala 


Gin 


Pro 


Glu 


Asp 


Ala Gly 


Glu Tyr 


Val 


Cys 


Glu Thr 










125 








13 0 








135 


Arg 


His 


Glu 


Ala 


He 


Thr 


Phe 


Asn Val 


He 


Leu 


Ala 


Glu 


Pro Pro 
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140 145 150 

Val Gin Phe Leu Ala Leu Glu Thr Thr Pro Ser Pro Leu Cys Val 

155 160 165 

Gly Pro Gly Glu Pro Val Val Gin Glu Gly Glu Gly Leu Glu Leu 

170 175 180 

His Ala Glu Gly Pro Ala Glu Ser Leu His 

185 190 

<210> 76 

<211> 295 . 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 142384 . 1 .orf 3 : 2000MAY01 
<400> 76 



Arg 


Thr 


Cys Cys 


Arg Val 


Val 


Pro Glu 


Ala 


Lys 


Gin Arg Trp Arg 


i 




5 








10 




15 


ax y 


Val 

val 


Arg Leu 


Arg Arg Arg 


Gin Arg Arg 


Ala 


Pro Glv Aror Arcr 








20 








25 




30 


Ala 


Pro 


Gly Arg 


Ala 


Ala 


Leu 


Leu Val 


Leu 


Leu 


Ala Leu Ala Ala 






35 








40 




45 


Ala 


Ala 


Ala Gly 


Ser 


Gly Arg 


Leu Ser 


Cys 


Arg 


Met Cys Gly Arg 








50 








55 




60 


Arg 


Arg 


Arg Ser 


Val 


Gly Gly 


Ala Gly Gly 


Pro 


Gly Ser Gly Leu 








65 








70 




75 


Ala 


Pro 


Leu Pro 


Gly Leu 


Pro 


Pro Ser 


Ala 


Ala 


Ala His Gly Ala 








80 








85 




90 


Ala 


Leu 


Leu Ser 


His 


Trp 


Asp 


Pro Thr 


Leu 


Ser 


Ser Asp Trp Asp 








95 




100 




105 


Gly 


Glu 


Arg Thr 


Ala 
110 


Pro 


Gin 


Cys Leu 


Leu 
115 


Arg 


He Lys Arg Asp 
120 


lie 


Met 


Ser He 


Tyr 


Lys 


GlU 


Pro Pro 


Pro 


Gly 


Met Phe Val Val 








125 






130 




13 5 


Pro 


Asp 


Thr Val 


Asp Met Thr 


Lys He 


His 


Ala 


Leu He Thr Gly 






140 






145 




150 


Pro 


Phe 


Asp Thr 


Pro 


Tyr 


Glu 


Gly Gly Phe 


Phe 


Leu Phe Val Phe 






155 








160 




165 


Arg 


Cys 


Pro Pro 


Asp 
170 


Tyr 


Pro 


He His 


Pro 
175 


Pro 


Arg Val Lys Leu 
180 


Met 


Thr 


Thr Gly 


Asn 


Asn 


Thr 


Val Arg 


Phe 


Asn 


Pro Asn Phe Tyr 






185 








190 




195 


Arg 


Asn 


Gly Lys 


Val 


Cys 


Leu 


Ser He 


Leu 


Gly 


Thr Trp Thr Gly 




Ala 




200 








205 




210 


Pro 


Trp Ser 


Pro 


Ala 


Gin 


Ser He 


Ser 


Ser 


Val Leu lie Ser 






215 








220 




225 


He 


Gin 


Ser Leu 


Met 
230 


Thr 


GlU 


Asn Pro 


Tyr 
235 


His 


Asn Glu Pro Gly 
240 


Phe 


Glu 


Gin Glu 


Arg 
245 


His 


Pro 


Gly Asp 


Ser 
250 


Lys 


Asn Tyr Asn Glu 
255 


Cys 


lie 


Arg His 


Glu 
260 


Thr 


He 


Arg Val 


Ala 
265 


Val 


Cys Asp Met Met 
270 


Glu 


Gly 


Lys Cys 


Pro 
275 


Cys 


Pro 


Glu Pro 


Leu 
280 


Arg 


Gly Val Met Glu 
285 


Lys 


Ser 


Phe Leu 


Glu 
290 


Tyr 


Tyr 


Asp Phe 


Tyr 
295 







<210> 77 

<211> 288 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 895427 . 1 . orf 2 : 2000MAY01 
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<400> 77 



Ala 


Pro 


Arg 


Leu 


Trp 


1 








5 


Ser 


Gly 


Pro 


Pro 


Ala 








20 


Gly 


Gin 


He 


Glu 


Arg 
35 


Leu 


Cys 


Gly 


Ser 


Ala 
50 


Glu 


Arg 


Asp 


Ser 


Ala 
65 


Asp 


lie 


Asn 


Val 


He 
80 


Leu 


Pro 


Thr 


Pro 


Leu 
95 


Glu 


Ala 


Met 


Ala 


Pro 
110 


Leu 


Arg 


Ala 


Pro 


Glu 
125 


Arg 


Ala 


Thr 


Leu 


Thr 
140 


Ser 


Phe 


His 


Ala 


Tyr 
155 


Cys 


Phe 


Gly 


Pro 


Val 
170 


Pro 


Arg 


Ala 


Arg 


Ser 
185 


Phe 


Lys 


His 


His 


He 
200 


Pro 


Asp 


Pro 


Arg 


Leu 
215 


Leu 


Arg 


Pro 


Lys 


Arg 
230 


Pro 


Glu 


Val 


Val 


Thr 
245 


Pro 


Pro 


Ser 


Asn 


Arg 
260 


Gly 


Leu 


Pro 


Asp 


Leu 
275 


Arg 


Pro 


Leu 







Ala 


Cys 


Pro Cys 


His 








10 


Arg 


Cys 


Pro Tyr 


He 








25 


Arg 


Gly 


Leu Arg 


Val 








40 


Ala 


Val 


Lys Lys 


Glu 








55 


Ala 


Val 


Cys Leu 


Ser 








70 


Thr 


Gly 


He Leu 


Lys 








85 


He 


Thr 


Gin Pro 


Leu 








100 


Gly 


Thr 


Pro Gin 


Thr 








115 


Gly 


Ser 


Tyr Ser 


Cys 








130 


Leu 


Leu 


Leu Asp 


His 








145 


Asn 


Arg 


Met Thr 


Pro 






160 


Leu 


Leu 


Pro Ala 


Arg 








175 


Ser 


Gly 


Pro Gly 


Leu 








190 


Glu 


Val 


Leu His 


Tyr 








205 


Pro 


Arg 


Gin Ser 


Pro 








220 


Gin 


Pro 


Pro Leu 


His 








235 


Arg 


Pro 


Arg Gly 


Arg 








250 


Tyr 


Ala 


Gly Asp 


Trp 








265 


Trp 


Ala 


Gly Phe 


Pro 








280 



Cvs 


Trp 


Trp Ser Gly 








15 


He 


Gin 


Lys 


Cys Val 








30 


Val 


Gly 


Leu 


Tyr Arg 








45 


Leu 


Arg 


Asp 


Ala Phe 








60 


Glu 


Asp 


Leu Tyr Pro 








75 


Asp 


Tyr 


Leu Arg Glu 








90 


Tyr 


Lys 


Val 


Val Leu 








105 


Glu 


Phe 


Pro 


Pro Pro 








120 


Leu 


Pro 


Asp 


Val Glu 








135 


Leu 


Arg 


Leu 


Val Ser 








150 


Gin 


Asn 


Leu 


Ala Val 








165 


Gin 


Ala 


Pro 


Thr Arg 








180 


Ala 


Ser 


Ala 


Val Asp 








195 


Leu 


Leu 


Gin 


Ser Trp 








210 


Aso 


Val 


Ala 


Pro Tyr 








225 


Leu 


Pro 


Leu Ala Asp 








240 


Gly 


Gly 


Pro 


Glu Ser 








255 


Ser 


Val 


Cys Gly Arg 








270 


Val 


Arg 


Ala Arg Leu 



285 



<210> 78 

<211> 294 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 757439 . 1 . orfl : 2000MAY01 
<400> 78 



Leu 


Ala 


Ala 


Pro 


Gin 


Ser 


His 


Ser He 


Pro 


Ser 


Pro 


Pro Gly Ala 


1 








5 








10 






15 


His 


Leu 


Leu 


Lys 


Thr 


Arg 


Val 


Leu Pro 


Ser 


Ala 


Arg 


Arg Ala Arg 










20 








25 






30 


Ala 


Arg 


Gly 


Ala 


Arg 


Glu 


Leu 


Arg Ser 


Ala 


Arg 


Ala 


Met Gly Pro 










35 








40 






45 


Pro 


Pro 


Gly 


Ala 


Gly 


Val 


Ser 


Cys Arg 


Gly 


Gly 


Cys 


Gly Phe Ser 










50 








55 






60 


Arg 


Leu 


Leu 


Ala 


Trp 


Cys 


Phe 


Leu Leu 


Ala 


Leu 


Ser 


Pro Gin Ala 










65 






70 






75 


Pro 


Gly 


Ser 


Arg 


Gly 


Ala 


Glu 


Ala Val 


Trp 


Thr 


Ala 


Tyr Leu Asn 










80 








85 






90 


Val 


Ser 


Trp 


Arg 


Val 


Pro 


His 


Thr Gly 


Val 


Asn 


Arg 


Thr Val Trp 










95 








100 






105 


Glu 


Leu 


ser 


Glu 


Glu 


Gly 


Val 


Tyr Gly 


Pro 


Asp 


Ser 


Pro Leu Glu 
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110 










115 




120 


Pro 


Val 


Ala Gly 


Val 


Leu 


Val 


Pro 


Pro 


Asp 


Gly 


Pro Gly Ala Leu 






125 










130 




135 


Asn 


Ala 


Cys Asn 


Pro 


His 


Thr 


Asn 


Phe 


Thr 


Val 


Pro Thr Val Trp 






140 










145 




150 


Gly 


Ser 


Thr Val 


Gin 


Val 


Ser 


Trp 


Leu 


Gly 


Leu 


He Gin Arg Gly 






155 










160 




165 


Gly 


Gly 


Cys Thr 


Phe 


Ala 


Asp 


Lys 


He 


His 


Leu 


Ala Tyr Glu Arg 








170 










175 




180 


Gly 


Ala 


Ser Gly 


Ala 


Val 


He 


Phe 


Asn 


Phe 


Pro 


Gly Thr Arg Asn 




185 










190 




195 


Glu 


Val 


He Pro 


Met 


Ser 


His 


Pro 


Gly 


Ala 


Val 


Asp He Val Ala 








200 










205 




210 


He 


Met 


He Arg 


Gin 


Ser 


Glu 


Arg 


His 


Lys 


Asn 


Ser Ala He Tyr 








215 










220 




225 


Ser 


Lys 


Arg His 


Thr 


Ser 


Asp 


Asn 


Gly 


His 


Arg 


Ser Arg Glu Lys 








230 










235 




240 


Thr 


Tip 


Pro Leu 


Gly 


Glu 


Ser 


Leu 


Phe 


Asn 


Phe 


Phe Arg Phe Leu 








245 










250 




255 


Cys 


Pro 


Phe Leu 


Leu 


Leu 


Arg 


Arg 


Ala 


Thr 


Val 


Gly Tyr Phe He 








260 










265 




270 


Phe 


Tyr 


Ser Ala 


Arg 


Arg 


Leu 


Arg 


Asn 


Ala 


Arg 


Ala Gin Ser Arg 








275 










280 




285 


Lys 


Gin 


Arg Pro 


He 


Lys 


Gly 


Arg 


Cys 









290 

<210> 79 

<211> 196 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 1144066 . 1 . orf 3 : 2000MAY01 
<400> 79 



Gly 


Ala 


Thr 


Pro Arg 


Ala 


Gly 


Glu 


Arg 


Ala 


Pro 


Leu Leu Pro Asp 


1 






5 








10 




15 


Arg 


Ala 


Ala 


His Ala 


Ala 


Ser 


Gly 


Thr 


He 


Thr 


Val Ala Gly Arg 








20 










25 




30 


Arg 


Pro 


Val 


Gin He 


Leu 


Ser 


Glu 


Phe 


Phe 


Gly 


Ala Phe Ser Pro 






35 










40 


45. 


Arg 


Lys 


Leu 


Ala He 


Gin 


Lys 


Cys 


Ala 


Ser 


Arg 


Thr Ala Ala Ala 








50 










55 




60 


Met 


Gly 


Ser 


Glu Asp 


His 


Gly 


Ala 


Gin 


Lys 


Pro 


Ser Cys Lys He 








65 










70 




75 


•Met 


Thr 


Phe 


Arg Pro 


Thr 


Met 


Gly 


Glu 


Phe 


Lys 


Asp Phe Asn Lys 








80 










85 




90 


Tyr 


Val 


Gly 


Tyr He 


Glu 


Ser 


Gin 


Gly 


Ala 


His 


Arg Ala Gly Leu 








95 










100 




105 


Gly 


Lys 


He 


He Pro 


Pro 


Lys 


Glu 


Trp 


Lys 


Pro 


Arg Gin Thr Tyr 








110 










115 




' 120 


Asp 


Asp 


He 


Asp Asp 


Val 


Val 


He 


Pro 


Gly 


Pro 


He Gin Gin Val 








125 










130 




135 


Val 


Thr 


Gly 


Gin Ser 


Gly 


Leu 


Phe 


Thr 


Gin 


Tyr 


Asn He Gin Lys 








140 








145 




150 


Lys 


Gly 


Met 


Thr Val 


Gly 


Glu 


Tyr 


Arg 


Arg 


Leu 


Gly Asn Ser Glu 








155 








160 




165 


Lys 


Tyr 


Cys 


Thr Pro 


Arg 


Asp 


Gin 


Asp 


Phe 


Asp 


Asp Leu Glu Arg 








170 










175 




180 


Lys 


Tyr 


Trp 


Glu Gly 


Thr 


Leu 


Thr 


Leu 


Cys 


Leu 


Pro Asp Leu Arg 



185 190 195 

Gly 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 243 660 . 4 . orf 3 :2000MAY01 



<400> 80 

Glu Gly Trp Thr Gin Pro Gin Gin Ala Gly Glu Gly Pro His Pro 
15 10 15 

Ala Ala His Glu Cys Leu His Asp Leu Gin Gin Ala Ala Pro Gly 
20 25 30 

Pro Gly Pro Pro Ala Ser Ser Gin Pro Gly Gin Pro Asp Arg Gin 
35 40 45 

Gin Asp Pro Gly Arg Val Val Val Cys Pro Gly Ala Gin Gly Glu 
50 55 60 

Ala Glu Val Pro Arg Pro Gly Leu Pro Gly Glu Gly Gly Pro Leu 
' 65 70 75 

Gin Gly Pro Pro Ser lie Gly Ser Gly Ala Thr Arg Thr Glu Arg 
80 85 90 

Ser Pro Ala Gin Arg Pro Ser Pro Arg Ser Leu Gly Leu Ala Gly 
95 100 105 

Gly His Lys Glu Thr Arg Glu Arg Ser Met Ser Glu Thr Gly Thr 

110 115 120 

Ala Ala Cys Pro Trp Val Cys Pro Arg Glu Leu Leu Ser Val Ala 

125 130 135 

Ala Gin Thr Leu Leu Ser Ser Asp Thr Lys Ala Pro Gly Ser Ser 

140 145 150 

Ser Cys Gly Ala Glu Arg Leu His Thr Val Gly Gly Pro Gly Ser 

155 160 165 

Ala Arg Pro Arg Ala Phe Ser His Ser Gly Val His Ser Leu Asp 

170 175 180 

Gly Gly Glu Val Asp Ser Gin Ala Leu Gin Glu Leu Thr Gin Met 

185 190 195 

Val Ser Gly Pro Ala Ser Tyr Ser Gly Pro Lys Pro Ser Thr Gin 

200 205 210 

Tyr Gly Ala Pro Gly Pro Phe Ala Ala Pro Gly Glu Gly Gly Ala 

215 220 225 

Leu Ala Ala Thr Gly Arg Pro Pro Leu Leu Pro Thr Arg Ala Ser 

230 235 240 

Arg Ser Gin Arg Ala Ala Ser Glu Asp Met Thr Ser Asp Glu Glu 

245 250 255 

Arg Met Val lie Cys Glu Glu Glu Gly Asp Asp Asp Val He Ala 

260 265 270 

Asp Asp Gly Phe Gly Pro Thr Asp Leu Asp Leu Lys Cys Lys Glu 

275 280 285 

Arg Val Thr Asp Ser Glu Ser Gly Asp Ser Ser Gly Glu Asp Pro 

290 295 m 300 

Glu Gly Asn Lys Gly Phe Gly Arg Lys Val Phe Ser Pro Val He 

305 310 315 

Arg Ser Ser Phe Thr His Cys Arg Pro Pro Leu Asp Pro Glu Pro 

320 325 330 

Pro Gly Pro Pro Asp Pro Pro Val Ala Phe Gly Lys Gly Tyr Gly 

335 340 345 

Ser Ala Pro Ser Ser Ser Ala Ser Ser Pro Ala Ser Ser Ser Ala 

350 355 360 

Ser Ala Ala Thr Ser Phe Ser Leu Gly Ser Gly Thr Phe Lys Ala 

365 370 375 

Gin Glu Ser Gly Gin Gly Ser Thr Ala Gly Pro Leu Arg Pro Pro 

380 385 390 

Pro Pro Gly Ala Gly Gly Pro Ala Thr Pro Ser Lys Ala Thr Arg 

395 400 405 

Phe Leu Pro Met Asp Pro Ala Thr Phe Arg Arg Lys Arg Pro Glu 

410 415 420 

Ser Val Gly Gly Leu Glu Pro Pro Gly Pro Ser Val He Ala Ala 

425 430 435 

Pro Pro Ser Gly Gly Gly Asn He Leu Gin Thr Leu Val Leu Pro 



58/69 



WO 01/62922 



PCTYUS01/05896 



440 445 450 



Pro 


Asn 


Lys Glu 


Glu 


Gin 


Glu 


Gly Gly 


Gly 


Ala 


Arg 


Val Pro Ser 








455 










460 






465 


Ala 


Pro 


Ala Pro 


Ser 


Leu 


Ala 


Tyr Gly 


Ala 


Pro 


Ala 


Ala Pro Leu 








470 










475 






480 


Ser 


Arg 


Pro Ala 


Ala 


Thr 


Met 


Val 


Thr 


Asn 


Val 


Val 


Arg Pro Val 








485 










490 






495 


Ser 


Ser 


Thr Pro 


Val 


Pro 


lie 


Ala 


Ser 


Lys 


Pro 


Phe 


Pro Thr Ser 








500 










505 






510 


Gly 


Arg 


Ala Glu 


Ala 


Ser 


Pro 


Asn Asp 


Thr 


Ala 


Gly 


Ala Arg Thr 








515 










520 






525 


Glu 


Met 


Gly Thr 


Gly 


Ser 


Arg 


Val 


Pro 


Gly 


Gly 


Ser 


Pro Leu Gly 








53 0 










535 






540 


Val 


Ser 


Leu Val 


Tyr 


Ser 


Asp 


Lys 


Lys 


Ser 


Ala 


Ala 


Ala Thr Ser 








545 








550 






555 


Pro 


Ala 


Pro His 


Leu 


Val 


Ala 


Gly Pro 


Leu 


Leu 


Gly 


Thr Val Gly 








560 










565 






570 


Lys 


Ala 


Pro Ala 


Thr 


Val 


Thr 


Asn 


Leu 


Leu 


Val 


Gly 


Thr Pro Gly 








575 










580 






585 


Tyr 


Gly 


Ala Pro 


Ala 


Pro 


Pro 


Ala 


Val 


Gin 


Phe 


He 


Ala Gin Gly 








590 










595 






600 


Ala 


Pro 


Gly Gly 


Gly 


Thr 


Thr 


Ala Gly 


Ser 


Gly 


Ala 


Gly Ala Gly 








605 










610 






615 


Ser 


Gly 


Pro Asn 


Gly 


Pro 


Val 


Pro 


Leu 


Gly 


lie 


Leu 


Gin Pro Gly 








620 










625 






630 


Ala 


Leu 


Gly Lys 


Ala 


Gly 


Gly 


lie 


Thr 


Gin 


Val 


Gin 


Tyr He Leu 








635 










640 






645 


Pro 


Thr 


Leu Pro 


Gin 


Gin 


Leu 


Gin 


Val 


Ala 


Pro 


Ala 


Pro Ala Pro 








650 










655 






660 


Ala 


Pro 


Gly Thr 


Lys 


Ala 


Ala 


Ala 


Pro 


Met 


Arg 


Pro 


Cys Thr His 








665 










670 






675 


His 


Gin 


His Pro 


Phe 


His 


Pro 


Pro 


Thr 


Gly 


His 


Phe 


His Gin Arg 








680 










685 






690 


Gin 


Ser 


Pro Gly 


Cys 


His 


Cys 


Thr 


His 


Ser 


Trp 


His 


Pro His Pro 








695 










700 






705 


Ala 


Val 


Cys Thr 


Leu 


Arg 


Pro 


Thr 


Pro 


Gin 


Ser 


Pro 


Val Ser Phe 






710 








715 






720 


Ser 


Arg 


Ala Gly 


Pro 


Ala 


Pro 


Gly Trp 


Leu 


Ser 


Pro 


Ala Ala Ala 








725 










730 






735 


Trp 


Glu 


Gly Pro 


Ser 


Ala 


Ser 


Gly Arg 


Pro 









740 745 

<210> 81 
<211> 256 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc feature 

<223> Incyte ID No: LI : 334386 . 1 . orf 3 : 2000MAY01 
<400> 81 



Leu 


Ala 


Met 


Lys 


Asp 


Met 


Leu 


Thr Val 


Val Asp 


Leu Leu Leu Glu 


1 








5 








10 


15 


Gly 


Gly 


Ala 


Asp 


Val 


Asp 


His 


Thr Asp 


Asn Asn 


Gly Arg Thr Pro 










20 








25 


30 


Leu 


Leu 


Ala 


Ala 


Ala 


Ser 


Met 


Gly His 


Ala Ser 


Val Val Asn Thr 










35 






40 


45 


Leu 


Leu 


Phe 


Trp 


Gly 


Ala 


Ala 


Val Asp 


Ser He 


Asp Ser Glu Gly 










50 








55 


60 


Arg 


Thr 


Val 


Leu 


Ser 


He 


Ala 


Ser Ala 


Gin Gly 


Asn Val Glu Val 










65 








70 


75 


Val 


Arg 


Thr 


Leu 


Leu 


Asp 


Arg 


Gly Leu 


Asp Glu 


Asn His Arg Asp 










80 








85 


90 


Asp 


Ala 


Gly 


Trp 


Thr 


Pro 


Leu 


His Met 


Ala Ala 


Phe Glu Gly His 








95 








100 


105 


Arg 


Leu 


He 


Cys 


Glu 


Ala 


Leu 


He Glu 


Gin Gly 


Ala Arg Thr Asn 
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110 








115 






120 


GlU 


lie 


Asp Asn 


Asp 


Gly 


Arg 


He Pro 


Phe 


He 


Leu Ala Ser 


Gin 








125 








130 






135 


GlU 


Gly 


His Tyr 


Asp 


Cys 


Val 


Gin He 


Leu 


Leu 


Glu Asn Lys 


Ser 








140 








145 






150 


Asn 


He 


Asp Gin 


Arg 


Gly 


Tyr 


Asp Gly 


Arg 


Asn 


Ala Leu Arg 


Val 








155 








160 






165 


Ala 


Ala 


Leu Glu 


Gly 


His 


Arg 


Asp He 


Val 


Glu 


Leu Leu Phe 


Ser 








17 0 




175 






180 


His 


Gly 


Ala Asp 


Val 


Asn 


Cys 


Lys Asp 


Ala 


Asp 


Gly Arg Pro 


Thr 








185 








190 






195 


Leu 


Tyr 


He Leu 


Ala 


Leu 


Glu 


Asn Gin 


Leu 


Thr 


Met Ala Glu 


Tyr 








200 








205 






210 


Phe 


Leu 


Glu Asn 


Gly 


Ala 


Asn 


Val Glu 


Ala 


Ser 


Asp Ala Glu 


Gly 








215 








220 






225 


Arg 


Thr 


Ala Leu 


His 


Val 


Ser 


Cys Trp 


Gin 


Gly 


His Met Gly 


Asn 








230 








235 






240 


Gly 


Ala 


Gly Pro 


Asp 


Ser 


He 


Pro Cys 


Arg 


Arg 


Gin Cys Cys 


Arg 



245 250 255 

Gin 



<210> 82 

<211> 235 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 3 47 572 . 1 . orf 1 : 2000MAY01 
<400> 82 



Met 


Pro 


He 


Leu Pro 


He 


Ser Val Gin Leu 


Asp Ala Ser Leu 


Leu 


1 






5 




10 




15 


He 


Cys 


Leu 


Val He 


Cys 


Ala Gly Arg Phe 


Trp Thr Asn Leu 


Tyr 








20 




25 




30 


Ser 


Leu 


Thr 


Val Pro 


Phe 


Gly Gin Lys Pro 


Asn He Asp Val 


Thr 








35 




40 




45 


Asp 


Ala 


Met 


Val Asp 


Gin 


Ala Trp Asp Ala 


Gin Arg He Phe 


Lys 








50 




55 




60 


Glu 


Ser 


Ala 


Glu Leu 


Leu 


Cys lie Cys Trp 


Ser Ser Leu Tyr 


Asp 








65 




70 




75 


Ser 


Arg 


He 


Leu Arg 


Gin 


He Pro Cys Tyr 


Thr Asp Pro Gly 


Asn 








80 




85 




90 


Val 


Gin 


Lys 


Ala Leu 


Cys 


His Pro His Ser 


Leu Gly Pro Gly 


Glu 








95 




100 




105 


Gly 


Arg 


Leu 


Gin Arg 


Ser 


Leu Cys Ala Gin 


Arg Val Thr Met 


Asp 








110 




115 




120 


Asp 


Phe 


Leu 


Thr Ala 


His 


His Glu Met Gly 


His He Gin Tyr 


Asp 








125 




13 0 




135 


Met 


Ala 


Tyr 


Ala Gly 


Gin 


Pro Phe Ser Ala 


Lys Glu Met Glu 


Leu 








140 




145 




150 


Asn 


Glu 


Gly 


Phe His 


Glu 


Ala Val Gly Glu 


He Met Ser Leu 


Ser 








155 




160 




165 


Ala 


Ala 


Thr 


Pro Lys 


His 


Leu Lys Ser He 


Gly Leu Leu Ser 


Pro 








170 




175 




180 


Glu 


Phe 


Ser 


Thr Asn 


Asp 


Asn Glu Thr Glu 


He Asn Phe Leu 


Leu 








185 


190 




195 


Lys 


Gin 


Ala 


Leu Thr 


He 


Val Gly Thr Leu 


Pro Phe Thr Tyr 


Met 








200 




205 




210 


Leu 


Glu 


Lys 


Trp Arg 


Trp 


Met Val Phe Lys 


Arg Gly Asn Ser 


Gin 








215 




220 




225 


Arg 


Pro 


Val 


Gly Glu 


Lys 


Gly Gly Gly Arg 












230 




235 







<210> 83 
<211> 617 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 817314 . 1 .orfl : 200 0MAY01 
<400> 83 



Asn 


Met 


Ala 


Gin 


Phe 


Tyr 


Tyr 


Lys Arg 


Asn 


Val 


Asn 


Ala Pro 


Tyr 


1 








5 










10 








15 


Arg 


Asp 


Arg 


He 


Pro 


Leu 


Arg 


He 


Val 


Arg 


Ala 


Glu 


Ser Glu 


Leu 










20 










25 








30 


Ser 


Pro 


Ser 


Glu 


Lys 


Ala 


Tyr 


Leu 


Asn 


Ala 


Val 


Glu Lys Gly 


Asp 
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45 


Tyr 


Ala 


Ser 


Val 


Lvs 
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Ser 


Leu Glu 


Glu 


Ala 


Glu 


He Tyr 


Phe 
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60 


LVS 


lie 


Asn 


He 


Asn 


Cvs 

J: 


He 


Asp 


Pro 


Leu 


Glv 


Arg Thr Ala 


Leu 










65 








70 
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Leu 


He 


Ala 


He 


Glu 


Asn 


Glu 


Asn 


Leu 


Glu 


Leu 


He 


Glu Leu 


Leu 
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Leu 


Ser 


Phe 


Asn 


Val 


Tvr 


Val 


Gly Asp 


Ala 


Leu 


Leu 


His Ala 


He 










95 
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Arg 


Lys 


Glu 


Val 


Val 


Gly 


Ala 


Val 


Glu 


Leu 


Leu 


Leu 


Asn His 


Lvs 
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Lys 


Pro 


Ser 


Glv 


Glu 


Lys 


Gin 


Val 


Pro 


Pro 


He 


Leu 


Leu Asp 


Lys 
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130 
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Gin 


Phe 
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Glu 


Phe 


Thr 


Pro 


Asp 


He 


Thr 


Pro 


He 


He Leu 


Ala 
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Ala 


His 


Thr 


Asn 


Asn 


Tvr 


Glu 


lie 


He 


Lys 


Leu 


Leu 


Val Gin 


Lvs 
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160 








165 


Glv 


Val 


Ser 


Val 


Pro 


Arg 


Pro 


His 


Glu 


Val 


Arcr 


Cys 


Asn Cys 


Val 










170 










175 








180 


Glu 


Cvs 


Val 


Ser 


Ser 


Ser 


Asp 


Val 


Asp 


Ser 


Leu 


Arg 


His Ser 


Arg 










185 










190 








195 


Ser 


Arg 


Leu 


Asn 


He 


Tvr 


Lvs 


Ala 


Leu 


Ala 


Ser 


Pro 


Ser Leu 


He 










200 






205 








210 


Ala 


Leu 


Ser 


Ser 


Glu 


Asp 


Pro 


Phe 


Leu 


Thr 


Ala 


Phe 


Gin Leu 


Ser 










215 








220 








225 


Trp 


Glu 


Leu 


Gin 


Glu 


Leu 


Ser 


Lys 


Val 


Glu 


Asn 


Glu Phe Lys 


Ser 










230 










235 








240 


Glu 


Tvr 


GlU 


Glu 


Leu 


Ser 


Arcr 


Gin Cys 


Lvs 


Gin 


Phe Ala Lys 


Asp 










245 










250 








255 


Leu 


Leu 


Asp 


Gin 


Thr 


Ara 


Ser 


Ser Arg 


Glu 


Leu 


Glu 


He He 


Leu 
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Asn 


Tyr 


Arg 


Asp 


Asp 


Asn 


Ser 


Leu 


He 


Glu 


Glu 


Gin 


Ser Gly 


Asn 
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280 








285 


Asp 


Leu 


Ala 


Arcr 


Leu 


Lvs 


Leu 


Ala 


He 


Lvs 


Tvr 


Arg Gin Lys 


Glu 
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Phe 


Val 


Ala 


Gin 


Pro 


Asn 


Cys 


Gin 


Gin 


Leu 


Leu 


Ala 


Ser Arg 


Trp 










305 










310 








315 


Tyr 


Asp 


Glu 


Phe 


Pro 


Gly 


Trp 


Arg 


Arg 


Arg 


His 


Trp 


Ala Val 


Lys 










320 










325 








330 


Met 


Val 


Thr 


Cys 


Phe 


He 


He 


Gly Leu 


Leu 


Phe 


Pro 


Val Phe 


Ser 
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340 








345 


Val 


Cys 


Tyr 


Leu 


He 


Ala 


Pro 


Lys 


Ser 


Pro 


Leu 


Gly Leu Phe 


He 










350 








355 








360 


Arg 


Lys 


Pro 


Phe 


He 


Lys 


Phe 


He 


Cys 


His 


Thr 


Ala 


Ser Tyr 


Leu 










365 










370 








375 


Thr 


Phe 


Leu 


Phe 


Leu 


Leu 


Leu 


Leu 


Ala 


Ser 


Gin 


His 


He Asp 


Arg 
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385 
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Ser 


Asp 


Leu 


Asn 


Arg 


Gin 


Gly 


Pro 


Pro 


Pro 


Thr 


He 


Val Glu 


Trp 
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400 








405 


Met 


He 


Leu 


Pro 


Trp 


Val 


Leu 


Gly Phe 


He 


Trp 


Gly Glu He 


Lys 
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415 
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Gin 


Met 


Trp 


Asp 


Gly 


Gly 


Leu 


Gin Asp 


Tyr 


He 


His Asp Trp 


Trp 
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435 


Asn 


Leu 


Met 


Asp 


Phe 


Val 


Met 


Asn 


Ser 


Leu 


Tyr 


Leu 


Ala Thr 


lie 
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440 


Ser 


Leu 


Lys 


He Val Ala Phe Val Lys 










455 


Arg 


Glu 


Ser 


Trp Asp Met Trp His Pro 










470 


Leu 


Phe 


Ala 


He 


Ala Asn lie Phe Ser 










485 


Leu 


Phe 


Thr 


Ala 


Asn Ser His Leu Gly 










500 


Gly 


Arg 


Met 


Leu 


Leu Asp He Leu Lys 










515 


Leu 


Val 


Leu 


Leu 


Ala Phe Ala Asn Gly 










53 0 


Tyr 


Tyr 


Glu 


Glu 


Thr Lys Gly Leu Thr 










545 


Glu 


Lys 


Gin 


Asn 


Asn Ala Phe Ser Thr 










560 


Ser 


Leu 


Phe 


Trp 


Ser He Phe Gly Leu 








575 


Asn 


Val 


Lys 


Ala 


Gin His Glu Phe Thr 










590 


Leu 


Phe 


Gly 


Asp 


He Thr Met Ser Ser 






605 


Thr 


Cys 
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445 




450 


Tyr 


Ser 


Ala Leu Asn Pro 


460 




465 


Thr 


Leu 


Val Ala Glu Ala 


475 




480 


Ser 


Leu 


Arg Leu He Ser 


490 




495 


Pro 


Leu 


Gin He Ser Leu 


505 




510 


Phe 


Leu 


Phe He Tyr Cys 


520 




525 


Leu 


Asn 


Gin Leu Tyr Phe 


535 




540 


Cys 


Lys 


Gly He Arg Cys 


550 




555 


Leu 


Phe 


Glu Thr Leu Gin 


565 




570 


He 


Asn 


Leu Tyr Val Thr 


580 




585 


Glu 


Phe 


Val Gly Ala Thr 


595 




600 


Leu 


Trp 


Leu Phe Tyr Ser 


610 




615 



<210> 84 
<211> 293 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 000290 . 1 . orf 3 : 2000MAY01 
<400> 84 

Gly Ala His Ala Lys Thr Gly He Gin He Gly Met Leu Ser Thr 
1 5 10 15 

Gly Lys Asp Arg Ser Leu Arg Val Thr Gly Met Thr Trp Arg Ser 
20 25 30 

Ser Tyr Val Pro Val Ser Ala Pro Pro Pro Asn Ser Ser Glu Gin 
35 40 45 

Tyr Ser Ser Gly Ala Gin Ser He Pro Ser Thr Val Thr Val He 
50 55 60 

Ala Pro Trp Ser Pro Thr Leu Glu Asn Thr Thr Trp Glu Leu Val 
65 70 75 

Leu Leu Leu Leu Lys He He Ser Ser Ser Asn Ser Phe Gly Arg 
80 85 90 

Asn Leu Pro Pro Lys Arg Arg Cys Arg Asp Tyr Asp Glu Arg Gly 
95 100 105 

Phe Cys Val Leu Gly Asp Leu Cys Gin Phe Asp His Gly Asn Asp 
110 ~ 115 120 

Pro Leu Val Val Asp Glu Val Ala Leu Pro Ser Met He Pro Phe 
125 130 135 

Pro Pro Pro Pro Pro Gly Leu Pro Pro Pro Thr Thr Pro Gly Met 
140 145 150 

Leu Met Pro Pro Met Pro Gly Pro Gly Pro Gly Pro Gly Pro Gly 
155 160 1*65 

Pro Gly Pro Gly Pro Gly Pro Gly Pro Gly Pro Gly His Ser Met 
170 175 180 

Arg Leu Pro Val Pro Gin Gly His Gly Gin Pro Pro Pro Ser Val 
185 190 195 

Val Leu Pro He Pro Arg Pro Pro He Thr Gin Ser Ser Leu lie 
200 205 210 

Asn Ser Arg Asp Gin Pro Gly Thr Ser Ala Val Pro Asn Leu Ala 
215 220 225 

Ser Val Gly Thr Arg Leu Pro Pro Pro Leu Pro Gin Asn Leu Leu 
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230 235 
Tyr Thr Val Ser Glu Arg Gin Pro Met Tyr Ser Arg Glu His 

245 250 
Ala Ala Ala Ser Glu Arg Leu Gin Leu Gly Thr Pro Pro Pro 

260 265 
Leu Ala Ala Arg Leu Val Pro Pro Arg Asn Leu Met Gly Ser 

275 280 
lie Gly Tyr His Thr Ser Val Ser 

290 

<210> 85 

<211> 276 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 023518 . 3 .orf 3 : 2000MAY01 
<400> 85 



Leu 


Ser 


Pro 


Asp 


Arg 


Leu 


Leu 


Val Leu Pro 


Asp 


Asn Tyr Ser 


His 


1 








5 






10 






15 


Phe 


Ser 


Gin 


Ala 


Ser 


Ala 


Asn 


Leu Gin Gly 


Pro 


Ser Arg Thr 


Thr 










20 






25 






30 


Glu 


Leu 


Phe 


His 


Pro 


Thr 


Leu 


Ala Ser lie 


Ser 


Ser Pro Met 


Leu 










35 






40 






45 


Glu 


Gly 


Ala 


Glu 


Leu 


Tyr 


Phe 


Asn Val Asp 


His 


Gly Tyr Leu 


Glu 










50 






55 






60 


Gly 


Leu 


Val 


Arg 


Gly 


Cys 


Lys 


Ala Ser Leu 


Leu 


Thr Gin Gin 


Asp 










65 






70 






75 


Tyr 


He 


Asn 


Leu 


Val 


Gin 


Cys 


Glu Thr Leu 


Glu 


Ala Pro Phe 


Phe 










80 






85 






90 


Gin 


Asp 


Cys 


Met 


Ser 


Glu 


Asn 


Ala Leu Asp 


Glu 


Leu Asn He 


Glu 






95 






100 






105 


Leu 


Leu 


Arg 


Asn 


Lys 


Leu 


Tyr 


Lys Ser Tyr 


Leu 


Glu Ala Phe 


Tyr 










110 






115 






120 


Lys 


Phe 


Cys 


Lys 


Asn 


His 


Gly 


Asp Val Thr 


Ala 


Glu Val Met 


Cys 










125 






130 






135 


Pro 


He 


Leu 


Glu 


Phe 


Glu 


Ala 


Asp Arg Arg 


Ala 


Phe He He 


Thr 










140 






145 






150 


Leu 


Asn 


Ser 


Phe 


Gly 


Thr 


Glu 


Leu Ser Lys 


Glu 


Asp Arg Glu 


Thr 










155 






160 






165 


Leu 


Tyr 


Pro 


Thr 


Phe 


Arg 


Gin 


Leu Tyr Pro 


Glu 


Gly Leu Arg 


Leu 










170 






175 






180 


Leu 


Ala 


Gin 


Ala 


Glu 


Asp 


Phe 


Asp Gin Met 


Lys 


Asn Val Ala 


Asp 










185 




190 






195 


His 


Tyr 


Gly Val 


Tyr 


Lys 


Pro 


Leu Phe Glu 


Ala 


Val Gly Gly 


Ser 








200 




205 






210 


Gly 


Gly 


Lys 


Thr 


Leu 


Glu Asp 


Val Phe Tyr 


Glu 


Arg Glu Val 


Gin 










215 






220 






225 


Met 


Asn 


Val 


Leu 


Ala 


Phe 


Asn 


Arg Gin Phe 


His 


Tyr Gly Val 


Phe 










230 






235 






240 


Tyr 


Ala 


Tyr 


Val 


Lys 


Leu 


Lys 


Glu Gin Glu 


He 


Arg Asn He 


Val 










245 




250 






255 


Trp 


He 


Ala 


Glu 


Cys 


He 


Ser 


Gin Arg His 


Arg 


Thr Lys He 


Asn 










260 






265 






270 


Ser 


Tyr 


He 


Pro 


He 


Leu 




















275 















<210> 86 

<211> 355 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 1084246 . 1 , orf3 : 2000MAY01 



240 
Gly 
255 
Leu 
270 
Ser 
285 
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<400> 86 



Pro Leu 


Asp Arg 


GlU 


Thr Ser 


Thr Glu Tyr 


Asn He Thr He 


Ala 


1 


5 






10 




15 


Val Thr 


Asp Leu 


Gly 


Thr Pro 


Arg Leu Lys 


Thr Gin Gin Asn 


He 




20 






25 




30 


Thr Val 


Gin Val 


Ser 


Asp Val 


Asn Asp Asn 


Ala Pro Ala Phe 


Thr 






35 




40 




45 


Gin Thr 


Ser Tyr 


Thr 


Leu Phe 


Val Arg Glu 


Asn Asn Ser Pro 


Ala 




50 






55 




60 


Leu His 


He Gly 


Ser 


Val Ser 


Ala Thr 


Asp 


Arg Asp Ser Gly 


Thr 




65 






70 




75 


Asn Ala 


Gin Val 


Thr 


Tyr Ser 


Leu Leu 


Pro 


Pro Gin Asp Pro 


His 






80 




85 




90 


Leu Pro 


Leu Ala 


Ser 


Leu Val 


Ser He 


Asn 


Ala Asp Asn Gly 


His 






95 






100 




105 


Leu Phe 


Ala Leu 


Arg 


Ser Leu 


Asp Tyr 


Glu 


Ala Leu Gin Ala 


Phe 






110 






115 




120 


Glu Phe 


Arg Val 


Gly 


Ala Ser 


Asp Arg Gly 


Ser Pro Ala Leu 


Ser 






125 






130 




135 


Ser Glu 


Ala Leu 


Val 


Arg Val 


Leu Val 


Leu 


Asp Thr Asn Asp 


Asn 






140 




145 




150 


Ser Pro 


Phe Val 


Leu 


Tyr Pro 


Leu Gin 


Asn 


Gly Ser Ala Pro 


Cys 






155 




160 




165 


Thr Glu 


Leu Val 


Pro 


Arg Ala 


Ala Glu 


Pro 


Gly Tyr Leu Val 


Thr 






170 




175 




180 


Lys Val 


Val Ala 


Val 


Asp Gly 


Asp Ser Gly 


Gin Asn Ala Trp 


Leu 




185 






190 




195 


Ser Tyr 


Gin Leu 


Leu 


Lys Ala 


Thr Glu 


Pro 


Gly Leu Phe Gly 


Val 




200 




205 




210 


Trp Ala 


His Asn 


Gly 


Glu Val 


Arg Thr Ala 


Arg Leu Leu Ser 


GlU 






215 






220 




225 


Arg Asp 


Ala Ala 


Lys 


His Arg 


Leu Val 


Val 


Leu Val Lys Asp 


Asn 






230 






235 




240 


Gly Glu 


Pro Pro 


Arg 


Ser Ala 


Thr Ala 


Thr 


Leu His Val Leu 


Leu 




245 






250 




255 


Val Asp 


Gly Phe 


Ser 


Gin Pro 


Tyr Leu 


Pro 


Leu Pro Glu Ala 


Ala 


260 




265 




270 


Pro Ala 


Gin Ala 


Gin 


Ala Asp 


Ser Leu 


Thr 


Val Tyr Leu Val 


Val 






275 




280 




285 


Ala Leu 


Ala Ser 


Val 


Ser Ser 


Leu Phe 


Leu 


Phe Ser Val Leu 


Leu 






290 






295 




300 


Phe Val 


Ala Val 


Arg 


Leu Cys 


Arg Arg 


Ser 


Arg Ala Ala Ser 


Val 






305 






310 




315 


Gly Arg 


Cys Ser 


Val 


Pro Glu 


Gly Pro 


Phe 


Pro Gly His Leu 


Val 






320 






325 




330 


Asp Val 


Ser Gly 


Thr 


Gly Thr 


Leu Ser 


Gin 


Glu Leu Pro Val 


Arg 


335 




340 




345 


Gly Val 


Ser Asp 


Arg 


Arg Leu 


Trp Asp Trp 










350 






355 







<210> 87 

<211> 745 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1165828 . 1 .or f 2 :2000MAY01 
<400> 87 

Val Phe Glu Met Leu Tyr Ser Ser Arg Gly Asp Pro Glu Gly Gin 
15 10 15 

Pro Leu Leu Leu Ser Leu Leu He Leu Ala Met Trp Val Val Gly 

20 25 30 

Ser Gly Gin Leu His Tyr Ser Val Pro Glu Glu Ala Glu His Gly 

35 40 45 

Thr Phe Val Gly Arg He Ala Gin Asp Leu Gly Leu Glu Leu Ala 
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50 55 60 
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